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WO 00/18929 . PCT/EP99/07004 

Novel Compounds 

i 

The present invention relates to recombinant heterochimeric paramyxoviridae 
glycoproteins and their expression in eukaryotic cells, particularly in Chinese 
5 Hamster Ovary (CHO) cells. The invention further relates to methods for 

constructing and expressing such heterochimeric proteins, intermediates for use 
therein, methods to optimize the codon usage of the nucleic acid sequences which 
encode such heterochimeric proteins and the use of the recombinant proteins as 
vaccines for the prevention of diseases caused by paramyxoviridae pathogens. 

10 , 

The mumps (MuV), Measles (MV), the parainfluenza type I (PIV1), type II (PIV2) 

and type III (PIV3) and the respiratory syncytial (RSV) virus belong to the 

paramyxoviridae family. The MuV is classified in the rubulavirus subclass, the MV 

is classified in the Morbillivirus subclass, the parainfluenza viruses (PIV1, PIV2 

15 and PIV3) are classified in the paramyxovirus subclass while the RSV is attached to 

the pneumo virus subclass. 

RSV is the most important cause of viral lower respiratory tract disease in infants 
and children. The fusion (F) and the attachment (G) protein which are both viral 
20 surface glycoproteins appear to be of potential value for the development of a 
vaccine against RSV. 

r 

The fusion protein F of RSV contains 574 amino acid residues; amino acids 1 to 21 
correspond to the signal peptide and residues 525 to 549 to the membrane anchor 
25 domain. The molecule presents five potential sites for glycosylation. The F protein 
is synthesized as a 70. kDa precursor (F 0 ) which undergoes proteolytic maturation to 
yield the F, subunit (48 kDa) and F 2 (23 kDa) linked via disulfide bridges. The 
protein F, when injected into animals, leads to the production of neutralizing 
antibodies and may induce cytotoxic lymphocytes (CTLs). 



30 



The attachment or G protein of RSV contains 298 amino acid residues and is 
heavily glycosylated since half of its molecular mass (90 kDa) is contributed by 
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oligosaccharide side chains, chiefly in the form of O-linked sugars. It has been 
shown that the G protein, when injected into animals, provides protection against 
homologous but not heterologous subgroup virus challenge. This protein is 
extremely variable and there is only a stretch of 13 amino acid residues which is 
> conserved in all RSV. 



( 20 



; 



The PIV3 is second to RSV as a major agent of severe viral respiratory tract 
infections in infants. The fusion protein F of PIV3 contains 539 amino acid 
residues; amino acids 1 to 18 correspond to the signal peptide and residues 494 to 

10 516 to the membrane anchor domain. The molecule presents 4 potential sites for 
glycosylation. The F protein is synthesized as a 70 kDa precursor (F 0 ) which 
undergoes proteolytic maturation to yield the Fj (56 kDa) and F 2 (14 kDa) subunits 
linked via disulfide bridges. The protein F ; when injected into animals, leads to the 
production of neutralizing antibodies. The F protein is involved in cell fusion during 

15 viral infection and carries an hemolysin activity. Used alone for immunization, the 
F protein generates an immune response which is insufficient to confer protection 
against a challenge with the virus. Complete protection is only acquired by 
concomitant immunization with the attachment protein HN, another glycoprotein of 
PIV3. 



^ The protein HN carries hemagglutinin and neuraminidase activities. It is composed 
of 572 amino acids; its membrane anchor domain occurs in the N-terminal end of 
the molecule, between amino acid residues 32 and 53. Four potential sites for 
glycosylation have been identified. Injection of protein HN into animals generates 

25 an immune response and neutralizing antibodies. These antibodies however do not 
protect completely against a challenge with the virus. Full protection is obtained 
only by concomitant immunization with the F protein of PIV3 . 

The PIV1 virus was initially isolated from young children suffering from disorders 
30 of the lower respiratory tract. Infection with PIV1 causes the majority of cases of 
croup found for all infections caused by paramyxoviruses. Viral transmission of 
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PIV1 is by person to person contact or by aerosol, although the virus does not 
persist in the environment for long. 

j 

Like PIV2 and PIV3, the PIV1 virus has two surface glycoproteins, the fusion 
5 protein (F) and the attachment protein (HN). These two proteins are the priority 
targets for the development of a subunit vaccine, the properties of which would be 
to ensure protection of children from the very first months of life and to prevent 
reinfection, or at least to prevent the serious complications by restricting viral 
development to the upper respiratory tract where the consequences would be benign 
10 (common cold) . 

It 

PIV2 also affects very young children and. causes the same type of respiratory 
discorders, essentially croup, but of less severity. The PIV2 vims has two surface 
glycoproteins (F and HN), which are potential targets for the development of a 
15 subunit vaccine. 

The measles virus is an extremely contagious agent which establishes itself in the 
epithelial cells of the respiratory tract, the oropharynx or the conjunctiva. The 
infection causes fever, cough, head-cold, conjunctivitis and a characteristic 
20 generalised rash. 

There is no appropriate inactivated vaccine against measles but an effective 
attenuated live vaccine is available and is generally used in combination with the 
attenuated live vaccines against rubella and mumps. This live vaccine protects 

25 against the disease for at least 20 years. The measles virus has two surface 

, > 

glycoproteins, which are potential targets for the development of a subunit vaccine. 
The fusion protein (F) is a 550 amino acid long glycosylated molecule and, as for 
the other paramyxovirus, has to undergo proteolitic cleavage to yield F, and F 2 
subunits that are linked via disulfide bridges. This molecule, which carries a 
30 haemolysin activity, generates an immune protective response when injected into 
animals. The attachment protein (H)„ is a 617 amino acid long glycosylated protein, 
which carries a hemagglutinin activity. This protein leads, when injected into 

-3- 
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animals, to the production of neutralizing antibodies that are able to inhibit 
hemagglutination. This immune response protects the animal against a viral 
challenge. 

J 

5 The mumps virus is a pathogen causing the contagious infantile illness which 
consists of the inflammation of parotid glands. During the incubation period 
following infection, the virus replicates in the respiratory epithelium then 
disseminates into secretary ducts of the parotid glands. Other glands may become 
infected thereafter and numerous cases of meningitis have been reported. Among 
10 complications related to the infection, encephalitis is a serious one, with a mortality 
rate of about 1 %; deafness cases have also been reported. 

A vaccine against mumps is available: it is made of an attenuated live virus, 
produced by culturing infected embryonic chicken cells. The vaccine leads to the 
15 seroconversion in vaccinated individuals and protects against infection in more than 
95% of seronegative persons. The vaccine thus reduces significantly the frequencies 
of complications. 

In a number of cases, however, viral infection is not detected because the effects 
20 remain subclinical. Young children and aged people are most likely to develop 

complications from mumps infection. In view of the inherent risks related to the use 
of attenuated live vaccines, such as the potentiation of the illness upon natural 
surihfection in vaccinated individuals, it is desirable to improve the safety of the 



25 



vaccine, particularly for the groups at risk. 



The fusion protein F of mumps virus contains 538 amino acid residues; amino acids 
1 to 26 correspond to the signal peptide and residues 483 to 512 to the membrane 
anchor domain. The molecule presents 7 potential sites for glycosylation. The F 
protein is synthesized as a 65-74 kDa precursor (F 0 ) which undergoes proteolytic 
30 maturation to yield the F l (58-61 kDa) and F 2 (10-16 kDa) subunits linked via 
disulfide bridges. The protein F is involved in cell fusion during viral infection, 
carries an haemolysin activity and plays a role for viral penetration into cells. It 

■ - ' -4- 
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does not however carry the antibody dependent cellular cytotoxicity (ADCC) as 
observed for another mumps virus glycoprotein, HN. 

The protein HN (molecular weight 74-80 kDa) carries hemagglutinin and 
5 neuraminidase activities which are involved in virus attachment to cells and in the 
disruption of the host cell membranes. Protein HN (attachment protein or 
hemagglutinin-neuraminidase) generates neutralizing antibodies and appears 
important for the development of ADCC. Protein HN is composed of 582 amino 
acids; it carries a N-terminal anchor domain (residues 33 to 52) and 9 potential sites 
10 for glycosylation. 

For the viruses considered above, it appears that concomitant immunization with 
both membrane glycoproteins F and HN, or G in the case of RSV, are required to 
achieve full protection in the animal model. Chimeric proteins containing both the F 
15 and G proteins of RSV, or the F and HN proteins of PIV3 have shown complete 
protection against RSV or PIV3 challenge in cotton rats (Brideau et al, J Gen Virol, 
1989, 70 2637-2644 and Brideau et al, J Gen Virol, 1993, 74, 471-477). 

W093 14207 (Connaught) describes heterochimeric proteins comprising RSV and 
20 PIV3 proteins including F(RSV)xHN(PIV3) and F(PIV3)xG(RSV) hybrids, and 

suggests that such proteins can be expressed from a variety of host cells including 

bacterial, mammalian, insect, yeast and fungal ceils. The.specific examples . 

describe expression in insect Sf9 and High 5 cells and mammalian Vero cells. 

There is no specific disclosure of the use of CHO cells. The use of Sf9 and High 5 
25 cells is also described by Du et al, BIO/TECHNOLOGY 12,1994, 813-818. 

Homa et al (Upjohn), J Gen Virol, 1993, 74, 1995-1999 describes another 
heterochimeric protein, F(RSV)xHN(PIV3) expressed in insect cells using a 
recombinant baculovirus. 



30 



Homochimeric paramyxoviridae glycoproteins have also been described by several 
workers :- 
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WO8905823 (Upjohn) describes RSV FxG and GxF hybrids which can be expressed 
from bacterial, yeast, mammalian and insect cells. Example 7 describes the 
expression of an RSV FxG protein from CHO cells although there are no details of 
5 how successful such expression is. 

WO8910405 (Upjohn) describes PIV3 FxHN and HNxF hybrids which can be 
expressed from bacterial, yeast, mammalian and insect cells. Example 6 describes 
the expression of a PIV3 FxHN protein from CHO cells, however no details are 
10 given quantifying the extent of expression and secretion. 

Lehman et al (Upjohn), J Gen Virol. 1993. 74. 459-469 describes the expression of 
PIV3 FxHN in insect cells using recombinant baculovirus vectors as well as in CHO 



15 



cells. 



WQ9306218 (SmithKline Beecham Biologicals) describes PIV3 FxHN hybrids 
which can be expressed in eukaryotic cells including vaccinia, CHO or Vero cells. 
Example B)2 describes the expression of a Fs*a"xHNa" hybrid in CHO cells and 
indicates that the product was almost evenly distributed between cells and medium. 
20 No details are however given quantifying the extent of expression and secretion. 

WO9425600 (SmithKline Beecham Biologicals) describes MuV FxHN and HNxF 
hybrids which can be expressed in vaccinia, a mammalian cell (such as CHO) or a 
bacterial cell. Examples B) 3 and 4 describe the expression of s + FHNaxFa and 
25 Fs*axHNa in CHO cells however no details are given describing the extent of 
expression and secretion. 

Although this cited art may suggest that homochimeric paramyxoviridae 
glycoproteins can be expressed in a variety of cell lines including CHO cells it has 
30 now been discovered that in fact expression and secretion from CHO cells is not 
always successful and success cannot be predicted. Thus it has now been 
demonstrated that although a RSV FxG hybrid could be successfully expressed and 

-6- 
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secreted in CHO cells, analogous homochimeric hybrids from PIV3 and MuV could 
not in fact be expressed in CHO cells in such manner that they could be purified 
from the supernatant in significant quantities. 

5 Surprisingly, it has now been discovered that heterochiroeric hybrids can be 
successfully expressed and secreted in both CHO and insect cells. 

Accordingly in a first aspect the present invention provides a process for preparing a 
heterochimeric protein or an immunogenic derivative thereof comprising an 
10 immunogenicfTagmentofmefusion(F)proteinofRSV,PIVl,PIV2,PIV3,MV 

or MuV and an immunogenic fragment of the attachment (G.HNorH) protein of 
RSV pivi, PIV2, PIV3, MUV or MV which process comprises expressing 
recombinant DNA encoding the heterochimeric protein or immunogenic derivative 
thereof in CHO cells and recovering the protein. 

15 

By heterochimeric protein is meant one that does not contain a fusion or attachment 
protein from the same pathogen. 

This invention also provides novel heterochimeric proteins not previously described 
20 in WO 93 14207 which can be prepared using the process of the present invention. 

Thus, in a second aspect the present invention provides a heterochimeric protein or 
an immunogenic derivative thereof comprising an immunogenic fragment of the 
. fusion (F) protein of RSV, PIVI , PIV2, PIV3, MV or MuV and an immunogenic 
25 fragment of the attachment (G, HN or H) protein of RSV, PIVI, PIV2, P1V3, MuV 
or MV, with the proviso that where one of the immunogenic fragments is derived 
from RSV F, RSV HN or PIV3 F, PIV3 HN, the other of the immunogenic 
fragments is derived from MuV F, MuV HN. MV F. MV H, PIVI F.PIV1 Htf 
PIV2 F or PIV2 HN. , 



30 



By an immunogenic fragment of the fusion (F) protein of RSV, PIVI, PIV2, PIV3, 
MV or MuV is meant a part of the protein which contains at least one antigenic 



-7- 



DEC 04 2000 18=29 



PP1GE.10 



PCTYEP99/07004 

WO 00/18929 

determinant capable of raising an immune response specific to the F protein of 
RSV. PIV1, PIV2, PIV3, MV or MuV respectively. Included within this definition 
is the full length F protein, preferably however the immunogenic fragment is 
lacking the membrane anchor domain at its C-terminal end. 

5 

By an immunogenic fragment of the attachment protein (G. HN or H) of RSV. 
PIV1, PIV2. PIV3. MuV or MV is meant a part of the protein which contains at 
least one antigenic determinant capable of raising an immune response specific to 
the G protein of RSV, to the HN protein of PIV1 , PIV2, PIV3. MuV or the H 
10 protein of MV respectively . Included within this definition is the full length G or 
HN protein, preferably however the immunogenic fragment is lacking the 
signal/anchor domain at its N-terminal end. 

Preferably the heterochimeric protein is linked via an amino acid in the C-terminal 
15 part of the immunogenic fragment of the F protein of RSV, PIV1, PIV2, PIV3, 
MV or MuV to an amino acid in the N-terminal part of the immunogenic fragment 
of the G protein of RSV, the HN protein of PIV 1 . PIV2, PIV3, MuV or the H 
protein of MV. • n 



20 



25 



Suitably the heterochimeric protein commences at its N-terminal end with a signal 
sequence from the F protein of RSV. PIV1, PIV2, PIV3, MV or MuV. 
Conveniently this will be part of the corresponding immunogenic fragment of the F 
protein of RSV, PIV 1 , PIV2, PIV3, MV or MuV when this fragment is linked via 
its C-terminal end to the N-terminal end of the immunogenic fragment of the G 
protein of RSV, the HN protein of PIV1, PIV2, PIV3, MuV or the H protein of 
MV. 



Alternative signal sequences may also be employed. For example, the 
heterochimeric protein suitably commences at its N-terminal end with a signal 
30 sequence of tissue plasminogen activator (TPA). 



- 8 - 
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In order to enhance the level of expression the heterochimeric protein may further 
comprise a ubiquitin leader sequence which is suitably positioned after any signal 
sequence as hereinbefore described. Preferably the ubiquitin leader sequence is 
linked to the C-terminal end of the signal sequence of TPA. 

5 

Preferably the ubiquitin leader sequence is derived from yeast, for example as 
described in Ecker et al, J.Biological Chemistry, 1988, 264(13). 7715-7719. 

Suitably a cleavage site is positioned between the C-terminal end of the ubiquitin 
10 sequence and the N-terminal end of the immunogenic fragment of the F protein of 
RSV, PIV1, PIV2, PIV3, MV or MuV. 

In order to facilitate chromatographic purification the heterochimeric protein 
suitably comprises a polyhistidine tail, for example as described in Hochuli et al, 

15 BIO/TECHNOLOGY, 1988, 1321-1325. The polyhistidine tail preferably 

comprises from 2 to 6 adjacent histidine residues which is suitably attached at the C 
terminal end of the heterochimeric protein. Preferably a cleavage site is positioned 
between the polyhistidine tail and the C-terminal end of the immunogenic fragment 
of the G protein of RSV, the HN protein of PIV1, PIV2, PIV3, MuV or the H 

20 protein of MV. 

The cleavage site for the ubiquitin sequence and/or the polyhistidine tail may be 
chemical or enzymatic and preferably is an enterokinase cleavage site, for example 
as described in LaVallie et al, BIO-TECHNOLOGY, 1993, 187-193. 

25 

Following expression and purification, treatment with an enterokinase will cleave 
off any ubiquitin and/or polyhistidine sequence releasing the desired heterochimeric 
protein. 

30 Particular heterochimeric proteins of this invention include: 

the F protein of RSV lacking its membrane domain linked at its C-terminal end to 
the HN protein of MuV lacking its signal/anchor domain herein referred to as: 

-9- 
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Fs+a RSVxHNsaMuV, as well as 
( Fs + a PIV3 x HNs a' MuV; 
Fs + a'MuV x GsaRSV; and 
Fs + a MuV x HNs a'PIV3, and 
5 immunogenic derivatives thereof. 

The present invention also provides particular heterochimeric proteins which 

include: 

Fs + a'MuVxHs'aMV; or 

10 Fs + a'RSVxHNsa'PIVl; or 

Fs + aRSVxHNs a PIV2, and 
imunogenic derivatives thereof. 

The present invention also provides heterochimeric proteins comprising RSV and 
15 PIV3 proteins hot specifically disclosed in W093 14207, which advantageously can 
be expressed from CHO cells. 

These are: * 

Fs + a (1-526) RSV x HNs a (70-572) PIV3; 

Fs + a (1-492) PIV3 x Gs a (69-298) RSV; 
20 Fs + a (1-526) RSV x HNsa' (70-572) PIV3 bis; 

Fs + a (1-526) RSV x HNs a' (70-572) PIV3 ent his, and 

sTPA (1-21) UB (1-74) ent Fs a (24-526) x HN s a (70-572) PIV3, and 

immunogenic derivatives thereof. 

25 The heterochimeric proteins of the present invention are immunogenic. The term 
immunogenic derivative as used herein encompasses any molecule which is a 
heterochimeric polypeptide which is immunologically reactive with antibodies raised 
to the heterochimeric protein of the present invention or parts thereof or with 
antibodies recognising the F protein of RSV, PIV1, PIV2, PIV3, MV or MuV, the 

30 G protein of RSV, the HN protein of PIV1 , PIV2, PIV3, MuV, the H protein of 
MV, the RSV virus, the PIV1 virus, the PIV2 virus, the PIV3 virus, the MV virus 
or the MuV virus, or which, when administered to a human, elicits antibodies 

- 10 - 
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recognising the F protein of RSV, PIV1, PIV2, PIV3, MV or MuV, the G protein 
of RSV, the HN protein of PIV1, PIV2, PIV3, MuV, the H protein of MV, the 
RSV virus, the PIV1 virus, the PIV2 virus, the PIV3 virus, the MV virus or the 
s MuV virus. In particular immunogenic derivatives which are slightly longer or 

■> 5 shorter than the heterochimeric proteins of the present invention may be used. Such 
derivatives may, for example, be prepared by substitution, addition, or 
rearrangement of amino acids or by chemical modifications thereof including the 
coupling or for enabling the coupling of the heterochimeric proteins to other carrier 
proteins such as tetanus toxoid or Hepatitis B surface antigen. All such substitutions 

10 and modifications are generally well known to those skilled in the art of peptide 
chemistry. 

Immunogenic fragments of the heterochimeric proteins which may be useful in the 
preparation of vaccines may be prepared by expression of the appropriate gene 
15 fragments or by peptide synthesis, for example using the Merrifield synthesis (The 
Peptides, Vol 2., Academic Press, New York, p3). 

In a further aspect of the invention there is provided recombinant DNA encoding 
the heterochimeric protein of the invention. The recombinant DNA of the invention 
20 may form part of a vector, for example a plasmid, especially an expression plasmid 
from which the heterochimeric protein may be expressed. Such vectors also form 
part of the invention, as do host cells into which the vectors have been introduced. 

In order to construct the DNA encoding a heterochimeric protein according to the 
25 invention, cDNA containing the coding sequences of the RSV, PI VI, PIV2; PIV3, 
MV or MuV fusion and attachment proteins and optionally of the ubiquitin, 
polyhistidine and enterokinase cleavage sites may be manipulated using standard 
techniques [see for example Maniatis T. et al Molecular Cloning, Cold Spring 
Harbor Laboratory, Cold Spring Harbor N. Y. (1982)] as further described 
30 hereinbelow. 
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In another aspect of the invention there is described a process of enhancing the 
protein expression in mammalian cells by optimization of the codon usage of the 
nucleic acids transfected therein. Optimization of the codon usage involves the 
replacement of at least one non-preferred or less preferred codon in a natural gene 
5 encoding a heterochimeric protein by a preferred codon encoding the same amino 
acid. Highly mammalian-expressed genes have C or G at their degenerative 
position (third base in the codon) whereas the RSV or PI V3 -prevalent codons have 
A or T. At least one codon, and more prefereably all the codons of the RSV or 
PIV3 protein can be changed to fit at best the human usage, that is, the one (or 
10 ones) that is the most prevalent as shown below. 



Ala: GCC 


Cys: TGC 


His: CAC 


Met: ATG 


Thr: ACC 


Arg: CGC 
AGG 
CGG 


Gin: CAG 


lie: ATC 


Phe: TTC 


Tip: TGG . 


Asn: AAC 


Glu: GAG 


Leu: CTG 


Pro: CCC 


Tyr: TAC 


Asp: GAC 


Gly: GGC 


Lys: AAG 


Ser: AGC 
TCC 


Val: GTG 



15 Each amino. acid encoded by one of these codons are then considered humanised. 
The ratio between the number of humanised codons versus the total number of 
amino acids gives a percentage of humanisation as shown below. 



20 



1) 


F RSV (]-326)origiiuJ 




140/526 = 


27% 


2) 


r RSV (l-423)humaiuscd 


■ + (424-526)original 


403/526 = 


77% 


3) 


F RSV(M2£)huoiinued 


\ 


489/526 = 


93% 


4) 


F RSV (l-526)origir»l 


+ HN pi v3 (70-372) original 


258/1029 


= 25% 


5) 


P RSV a-526)huminued 


+ HN PIV 3 (70-572) originil 


528/1029 


= 51% 


6) 


F RSV (l-526)buraanucd 


* 

+ HN PrV J (70-372) humuised 




96% 



25 
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The invention also provides DNA encoding a heterochiraeric protein or 
immunogenic derivative thereof in which the codon usage of one or more nucleic 
acids has been substantially optimised and a process for expressing said DNA in a 
CHO or insect cell. 

5 

There have been a number of reports that have described a substantial amelioration 

i 

of protein expression in mammalian cells after re-engineering the nucleic acid 
sequence of the heterologous protein to fit the codon usage found in highly 
expressed human genes (Haas J., Park E-C. and Seed B., Codon usage limitation in 

10 the expression of HiV-1 envelope glycoprotein, Current Biology, 1996, 6, n°3, 
315-325 ; Kim C. H., Oh Y. and Lee T.H., Codon optimization for high-level 
expression of human erythropoietin (EPO) in mammalian cells, Gene 199, 1997,. 
293-301 ; Zolotukhin S., Potter M. Hauswirth W.W. Guy J. and Muzyczka N. A 
Humanized green fluorescent protein cDNA adapted for high level expression in 

15 mammalian cells, J. of Virology, July 1996; 70, n°7, 4646-4654). 

Vectors comprising such DNA, hosts transformed thereby and the truncated or 
hybrid proteins themselves, expressed as described hereinbelow all form part of the 
invention. 

20 

For expression of the proteins of the invention, plasmids may be constructed which 
are suitable either for transfer into vaccinia vims or transfection into CHO cells, 
insect cells or Vero cells. Suitable expression vectors are described hereinbelow. 
Preferably the proteins of the present invention are expressed in CHO or insect 
25 cells. 

For expression in vaccinia a vaccinia transfer plasmid such as pULB 5213 which is 
a derivative of pSCl 1 (Chakrabati et al 9 Molecular and Cellular Biology 5, 3403 - 
3409, 1985) may be used. In one aspect the protein may be expressed under the 
30 control of the vaccinia P7 5 promoter. 
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For expression in CHO-K1 cells a glutamine synthetase (GS) vector such as pEE14 
may suitably be used so that the protein is expressed under the control of the major 
immediate early promoter of human cytomegalovirus (hCMV-MIE). Alternatively 
a vector which allows the expression of the coding module as a polycistronic 
5 transcript with the neo selection gene may suitably be used. In one preferred aspect 
the coding module is under the control of the Rous Sarcoma Long Terminal Repeat 
(LTR) promoter. 



Preferably the plasmid for expression in CHO-K1 cells carries a GS expression 
10 cassette suitable for gene amplification using methionine sulphoximine (MSX). 
Alternatively the plasmid for expression in CHO-K1 cells carries a DHFR 
expression cassette suitable for gene amplification using methotrexate (MTX). 

Preferably expression of the heterochimeric protein of the present invention is 
15 carried out in the presence of sodium butyrate and/or dimethyl sulphoxide (DMSO) 
which may enhance gene expression. 

For expression in insect cells a shuttle vector such as pAcUWSl or pAcGP67 may 
be used: In one aspect the protein may be expressed under the control of the 
20 baculovirus plO promoter or the polyhedrin promoter. 

The expression system may also be a recombinant live microorganism, such as a virus 
or bacterium. The gene of interest can be inserted into the genome of a, live 
recombinant virus or bacterium. Inoculation and in vivo infection with this live vector 

25 will lead to in vivo expression of the antigen and induction of immune responses. 
Viruses and bacteria used for this purpose are for instance: poxviruses (e.g; vaccinia, 
fowlpox, canarypox), alphaviruses (Sindbis virus, Semliki Forest Virus, Venezuelan 
Equine Encephalitis Virus), adenoviruses, adeno-associated virus, picornaviruses 
(poliovirus, rhinovirus), herpesviruses (varicella zoster virus, etc), Listeria, Salmonella, 

30 Shigella, BCG. These viruses and bacteria can be virulent, or attenuated in various 
ways in order to obtain live vaccines. Such live vaccines also form part of the 
invention 
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In yet another aspect of the invention there is provided a vaccine composition 
comprising a heterochimeric protein or immunogenic derivative thereof according to 
the invention in combination with a pharmaceutical^ acceptable carrier, a protein 
5 according to the invention for use in vaccinating a mammal and the use of a protein 
according to the invention in the preparation of a vaccine. 

Optionally, and advantageously, the vaccine of the present invention is combined 
with other immunogens to afford a polyvalent vaccine. In a preferred embodiment 
10 the heterochimeric protein is combined with other subcomponents of RSV, PIV1 , 
PIV2, PIV3, MuV or MV, e.g. the single proteins F, G, HN or H or homochimeric 
proteins such as RSV FxG, PIV3 FxHN or MuV FxHN. 

In a particular aspect the invention further provides a vaccine composition 
15 comprising a protein according to the invention together with a suitable carrier or 
adjuvant. 

Vaccine preparation is generally described in New Trends and Developments in 
Vaccines, edited by Voller et ol % University Park Press, Baltimore, Maryland, 
20 U.S.A., 1978. Encapsulation within liposomes is described, for example by 
Fullerton, U.S. Patent 4,235,877. 

In the vaccine of the present invention , an aqueous solution of the protein(s) can be 
used directly. Alternatively, the protein, with or without prior lyophilisation, can 

25 be mixed, absorbed or adsorbed with any of the various known adjuvants. Such 
adjuvants include, but are not limited to, aluminium hydroxide, muramyl dipeptide 
and saponins such as Quil A. Particularly preferred adjuvants are MPL 
(monophosphoryl lipid A) and 3D-MPL (3 deacylated monophosphoryl lipid A) [US 
patent 4,912,094], optionally formulated with aluminium hudroxide (EP 0 689 454) 

30 or oil in water emulsions (WO 95/17210). A further preferred adjuvant is known as 
QS21 which can be obtained by the method disclosed in US patent 5,057,540. Use 
of 3D-MPL is described by Ribi et al. in Microbiology (1986) Levie et al. feds) 
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Amer. Soc. Microbiol. Wash. D.C., 9-13. Use of Quil A is disclosed by Dalsgaard 
et a/. t (1977), Acta Vet Scand, 18, 349. Use of combined 3D-MPL and QS21 is 
described in WO 94/00153 (SmithKline Beecham Biologicals s.a). QS21 may be 
advantageously formulated with cholesterol containing liposomes, wherein 3D-MPL 
i is present either in solution or incorporated in the membrane, as described in WO 
96/33739. 



As a further exemplary alternative, a heterochimeric protein of the invention or an 
immunogenic fragment thereof can be encapsulated within microparticles such as 

10 liposomes or associated with oil-in- water emulsions. Encapsulation within 
liposomes is described by Fullerton in US patent 4,235,877. In yet another 
exemplary alternative, a heterochimeric protein according to the invention or an 
immunogenic fragment thereof can be conjugated to an immunostimulating 
, macromolecule, such as killed Bordetella or a tetanus toxoid. Conjugation of 

15 proteins to macromolecules is disclosed, for example by Likhite in patent 4,372,945 
and Armor et al. in US patent 4,474,757. 

The amount of the protein of the present invention present in each vaccine dose is 
selected as an amount which induces an immunoprotective response without . 

20 significant, adverse side effects in typical vaccines. Such amount will vary 

depending upon which specific immunogen is employed and whether or not the 
vaccine is adjuvanted. Generally, it is expected that each dose will comprise 
1-lOOOjig of protein, preferably 1-200 ^g. An optimal amount for a particular 
vaccine can be ascertained by standard studies involving observation of antibody 

25 titres and other responses in subjects. 



The following examples and the attached figures (explained below) illustrate the 
invention. 



30 In the Figures: 

Figure 34A shows the impact of humanisation on the level of expression of FrHNp, 
where: 
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FhHNElO = product expressed by the pEE14FhHN transfected clone E10; 

FhHNE7 = product expressed by the pEE14FhHN transfected clone E7; 

FHNbis = product expressed by the pEEl4FHN transfected clone; 

+ but = 2mM Nabutyrate has been added to the cell medium, 3 days before 

harvest; 

pEE14 = negative control; 

Fdroso = pruified Fa- (drosophila derived); the standard protein in this ELISA 
assay wherein lul of standard corresponds to lng of product. 
Figure 34B shows humanisation impact on the level of expression of F^RN^, 
where the level of expression was determined by ELISA. Fdroso = purified Fa- 
(drosophila derived) that is the standard protein in this ELISA assay, lul of standard 
corresponds to lng of product. 
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EXAMPLES 

Example 1 

In order to vaccinate with a single immunogen, heterochimeric DNA molecules 
5 were constructed combining extracellular domains of the F and the attachment 
protein for each virus. DNA constructs for the PIV3 and MuV have already been 
described in WO9306218 and WO9425600, respectively. The DNA molecule, 
combining the extracellular domains of the RSV F and G proteins were constructed 
as described below. 

10 

The DNA pieces were first inserted into the mammalian expression vector based on 
the replicon of the Semliki Forest Virus (pSFVl). This expression system does not 
lead to a stable expression mammalian cell line but, however gives an indication 
whether or not the chimeric protein is expressed and whether the product is 
15 effectively secreted in the culture medium, which is advantageous for the 
purification procedure. 

Stable expression in the culture medium of mammalian cell lines is preferred to 
obtain good quality and quantities of paramyxovirus glycoproteins. All the chimeric 

20 modules have been inserted in the shuttle vector, the pEE14, which integrates in the 
genome of mammalian cells such as CHO-K1. A quite good expression level was 
obtained with the RSV FxG homochimeric recombinant protein, however negligible 
expression was obtained for the FxHN recombinant homochimeric protein of either 
PIV3 or MuV. Expression of heterochimeric proteins was obtained from CHO 

25 cells. 

Thus by constructing heterochimeric DNA molecules combining the extracellular 
domains of the F protein of one virus linked to the extra cellular domain of the HN 
or G protein of another virus and inserting them into the pEE14 vector for CHO 
30 expression it has been possible to raise the expression level of these proteins. These 
proteins may be used to achieve protection against at least two paramyxoviridae 
viruses with a single immunogen. 
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25 



Some of the chimeric molecules have been inserted into the shuttle vectors, 
pAcUW51 and pACGP67, which integrate in the genome of bacterial and 
lepidopteran cells. Surprisingly good expression of heterochimeric proteins was 
obtained from insect cells. 

Vector construction 
Preliminary Constructs 

a) Plasmid pNIV2819 

Starting from plasmid pNIV2801, a cDNA clone encoding inter alia the F protein 
of RSV (type RSS-2; received from Dr Pringle, UK) we reconstructed a cDNA 
module coding for the F protein lacking the membrane anchor sequence. 

1 

Plasmid pNIV2801 was digested with Pstl in order to recover a 1416 bp DNA piece 
encoding amino acid residues 18 to 489 of the F protein. Synthetic 
oligonucleotides, specifying respectively the sequences for amino acids 1 to 17 and 
490 to 526, were used to produce the corresponding cDNA fragments by the 
polymerase chain reaction performed with pNTV2801 DNA as template. The . 
primers were designed to generate also unique flanking restriction sites useful for 
subsequent cloning steps. The coding module was assembled, by ligation, from the 
three DNA pieces described above and introduced into the standard cloning vector 



pUC19, to create plasmid pNIV28 19. This plasmid encodes the RSV F protein 



b) Plasmid pNIY 2820 

The cDNA module encoding the full length F protein of RSV was constructed as 
follows. Using two synthetic oligonucleotides, the polymerase chain reaction was 
performed with pNTV2801 DNA as template to generate a 273 bp DNA fragment 



carrying its signal sequence but lacking its anchor sequence (figure 1). 
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encompassing the sequence coding for aa 490 to aa 574 of the F protein, the stop 
codon and unique restriction sites useful for subsequent cloning steps. This 
fragment was digested with NsH and EcoRl and substituted for the Nsil-EcoKl DNA 
piece present in the coding module of pNTV2819 (figure 2). The resulting plasmid, 
5 pNIV2820, thus encodes the RSV F protein carrying both signal and membrane 
anchor sequences. 

c) Plasmid pNTV2841 

10 In this construction, the DNA coding for aa 165 to 176 of the G protein of RSV is 
fused to the DNA encoding the RSV Fs + a* protein. This part of the G protein is 
conserved among both subgroups of RSV. 

i 

The starting material, pNTV2819 f was digested by Ncol and Smal yielding a 1601 
15 bp fragment. This fragment was subcloned into the Ncol and Mscl sites of 

pNTV103 (a derivative of pULB1221, see European Patent Application No. 186643) 
leading to pNIV2844. This subcloning allowed to place the translation initiation site 
of the F protein in a more favourable context according to the model proposed by 
Kozak (Kozak M, Nature 308, 241-246, 1984). 

20 

* 

A 1605 bp fragment was recovered from pNIV2844 by digestion with Kpril and Sail 
and introduced by ligation into pUC19 digested with Kpnl and Sail, creating 
pNIV2840. 

25 Two complementary synthetic oligonucleotides specifying the sequence for amino 
acids 165 to 176 of the G protein followed by a stop codon and flanked by Nsil. 
BamHi, EcoRl and HindUl sites were hybridized. The 55 bp resulting fragment was 
cloned into the pNIV2840 digested by Nsil and Hindlll, thus replacing a 142 bp . 
DNA sequence encoding amino acids 491 to 526 of the F protein. The resulting 

30 recombinant plasmid, pNIV2841, thus contains the sequence coding for amino acids 
1 to 490 of the F protein followed by amino acids 165 to 176 of the G protein 
(figure 3). 

i 
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Vector Construction 

I) For transfer int the pSFVl vector 

5 a) The RSV fusion protein lacking the membrane anchor domain fused to the 
MuV hemagglutinin-neuraminidase lacking the signal-anchor domain, Frsv (1- 
526) HN^y (60-582). 

Plasmid pNrV2875, a derivative of pNIV2820 which carries the DNA coding for 
10 the F protein of RSV in which the Spel restriction site has been eliminated by site- 
directed mutagenesis into the pUC19 vector, has been digested by Hindlll and 
ArpHI, and a 1618 bp fragment has been isolated. Plasmid pNIV3229, a derivative 
of pNIV3215 whose construction has been already described in WO9425600 and 
which carries the DNA coding for the HN protein of MuV into the pUC19 vector, 
15 has been digested with Bbsl and AzmHI; a 1580 bp fragment has been isolated. 
Both fragments were linked together by two complementary synthetic BspHl-Bbsl 
J oligonucleotides (Fig 4A) restoring the coding sequence of the chimeric molecule 
and were inserted into the BamHl-Hindlll site of the pUC19 vector leading to 
pNIV4102. (Fig4B) After the sequencing of the junction regions, the chimeric 
20 cassette was retrieved from pNIV4102 by a BarriRl digestion and was inserted into 
the Bamm site of the pSFVl vector (Liljestrom, P. and Garoff.H. (1991) . 
Bio/Technology 9, 1356). The resulting plasmid, pNIV4l04, contains into the 
pSFVl vector the sequence coding for amino acids 1 to 526 of the RSV F protein 
followed by amino acids 60 to 582 of the MuV HN protein. (Fig4C) 

25 

b) The RSV fusion protein lacking the membrane anchor domain fused to the 
PIV3 hemagglutinin-neuraminidase lacking the signal-anchor domain, Frsv (1- 
526) HN FIV3 (70-572). 

30 Plasmid pIBI-HN , a cDNA clone containing the complete coding sequence of 
protein HN of PIV3 as well as its 3' non coding sequence (received from Dr.K. 
Dimock, University of Ottawa, Canada), has been digested by Asel and BamUl and 
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a 1468 bp fragment has been isolated. Plasmid pNTV2875 (see supra), which carries 
the DNA encoding the F protein of RSV, in which the unique Spel site has been 
eliminated by site-directed mutagenesis, inserted into the pUC19 vector, has been 
digested by BamHI and BspHl, and a 1588 bp fragment has been isolated. Both 
5 fragments were linked together by two complementary synthetic BspHI-Asel 

oligonucleotides (Fig5A) and were inserted into the BamHI site of the pUC19 vector 
leading to pNIV4105 or to pNIV4109 (FigSB) depending of the orientation of the 
chimeric module in the vector. After the sequencing of the junction region, the 
chimeric cassette was retrieved by a BamHI digestion from pNIV4109 and inserted 
10 into the BamHI site of the pSFVl vector. The resulting plasmid, pNIV41 10, 

contains, inserted into the pSFVl vector, the sequence coding for amino acids 1 to 
526 of the RSV F protein followed by amino acids 70 to 572 of the PIV3 HN 
protein. (FigSC) 

15 c) The PTV3 fusion protein lacking the membrane anchor domain fused to the 
RSV attachment protein lacking the signal-anchor domain, F Prv2 (1-492) G^y 
(69-298). 

Plasmid pNIV3310, described in WO9306218 which carries the DNA coding for 
20 amino acids 1 to 484 of the PIV3 F protein followed by amino acids 87 to 572 of 
the PIV3 HN protein into the pIBI vector, was digested by EcoRI and BgRl, and a 
1435 bp fragment has been isolated. Plasmid pNIV2850, which carries the RSV G 
protein into the pUC19 vector, has been digested by Maelll and HindRl % and a 694 
bp fragment has been isolated. Both fragments were then linked together by using 
25 two complementary Bglil-MaeUl synthetic linkers (Fig6A) and were inserted into 
the EcoW-Hindlll sites of pUC19 vector leading to pNIV4103 (Fig6B). The 
chimeric module was then retrieved from the pUC 19 vector by a BamHl-Hindlll 
digestion. After treating the protruding ends with the Klenow polymerase, the 
chimeric cassette has been inserted into the Smal site of pSFVl vector. The 
30 resulting plasmid pNIV4106, thus contains the sequence coding for amino acids 1 to 
492 of the F protein of PIV3 followed by amino acids 69 to 298 of the G protein of 
x RSV inserted into the pSFV 1 vector (Fig6C) . 
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d) The PIV3 fusion protein lacking the membrane anchor domain linked to the 
MuV hemagglutinin-neuraminidase lacking the signal-anchor domain, F Prv3 (1- 
493) HN MmV (60-582). 

5 

Plasmid pNIV3310 (see supra, FHN PIV3 in pBBI) was digested by EcoM and BgM 
and a 1435 bp fragment was isolated, Plasmid pNIV3229 (see supra, HN MuV into 
pUC19) was digested by Bbsl and Hindlll, and a 1610 bp fragment was isolated. 
Both fragments were linked together by adding two synthetic complementary linkers 

10 specifying a Bgtll and a Bbsl ends (Fig7A) into the pUC19 vector leading to 

pNIV4117 (Fig7B). After sequencing the junction region, the chimeric cassette was 
retrieved from the pUC19 vector by a BamHl digestion and was inserted into the 
BamHl site of the pSFVl vector. The resulting plasmid pNIV4118 encodes, cloned 
in the pSFVl vector, the DNA sequence specifying amino acids 1 to 493* of the 

15 PIV3 fusion protein linked to amino acids 60 to 582 of the MuV HN protein 
(Fig7C). 

e) The MuV fusion protein lacking its membrane anchor domain linked to the 
RSV attachment protein lacking Us signal-anchor domain, F^y (1-482) G^v 
20 (69-298). 

Plasmid pNIV3221, described in WO9425600 which carries the sequence encoding 
amino acids 1 to 462 of the MuV fusion protein within the pUC19 vector, has been 
digested with EcoRI and &rFI, and a 771 bp fragment has been purified. Plasmid 

25 pNIV3221 has been also digested with BsrFI and Pstl, and a 628 bp fragment has- 
been isolated.- Plasmid pNIV2850 (see supra, into the pUC19) has been 
digested with Mae/7/ and Hind/// and a 694 bp fragment has been isolated. The 
three fragments were linked together; the F^y/C^ junction was created by adding 
to the ligation reaction two synthetic complementary oligonucleotide specifying Psil 

30 and Maelll sites (Fig8A), and were inserted into the EcoRI-Hindlll sites of the 
pBluescript vector leading to pNIV4113(Fig8B). The chimeric cassette was 
recovered from pNIV41 13 by a Asp718l digestion and, after treating the protruding 
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ends with the Klenow polymerase, was inserted into the Smd site of the pSFVl 
vector. The resulting plasmid, pNIV4114 contains into the pSFVl vector the 
sequence specifying amino acids 1 to 482 of the MuV F protein linked to amino 
acids 69 to 298 of the RSV G protein (Fig8C). 

5 

f) The MuV fusion protein lacking its membrane anchor domain linked to the 
PIV3 hemagglutinin-neuraminidase lacking its signal-anchor domain, F MuV (1~ 
482) HN pm (54-572). 

10 Plasmid pNIV4113 (see supra, F MuV x G^v in pBluescript) was digested by Bsal and 
BamHl, a 1469 bp fragment was isolated. Plasmid pNTV3308, described in 
WO9306218 and which carries the DNA sequence specifying amino acids 1 to 31 
followed by amino acids 54 to 572 of the PIV3 HN protein into the pIBI vector, 
was digested by EcoRl and BamUl and a 1569 bp fragment was isolated. Both 

15 fragments were linked together by two synthetic complementary linkers specifying 
Bsal and EcoRI sites (Fig9A) into the BaniHl site of pBluescript leading to 
pNIV4115 (Fig9B). The chimeric module was recovered from pNTV4115 by a 
BamHI digestion and was inserted into BamKI site of pSFVl vector. The resulting 
plasmid, pNIV4116, encodes, iii the pSFVl vector, the sequence specifying amino 

20 acids 1-482 of the MuV F protein fused to amino acids 54 to 572 of the PIV3 HN 
protein (Fig9C). 

g) The RSV fusion protein lacking its membrane anchor domain linked to the 
RSV attachment protein lacking its signal-anchor domain, F^y (1-526) Grsv(69- 

25 298). 

Plasmid pNIV2857 (Figl6A), a derivative of pNIV2841 and which contains the 
DNA sequence coding for amino acids 1 to 526 of the RSV fusion protein linked to 
amino acids 69 to 298 of the RSV attachment protein, has been digested by Asp7I8I 
30 and Hindlll and a 2180 bp fragment has been isolated. After treating the protruding 
extremities with Klenow's polymerase, this fragment has been inserted in the Smal 
site of the pSFVl vector. The resulting plasmid pNTV2870, contains in the pSFVl 
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vector, the DNA sequence coding for amino acids 1 to 526 of the RSV fusion 
protein linked to amino acids 69 to 298 of the RSV attachment protein (Figl6B). 

II) For transf ection int CHO cells 

5 * • • 

a) The RSV fusion protein lacking the membrane anchor domain fused to the 

MuV hemagglutinin-neuraminidase lacking the signal-anchor domain, (1- 
526) HN MuV (60-582). 

10 Plasmid pNIV4102 f (FiglOA, see supra, F^y x HN MuV into the pUC19 vector) has 
been digested with BamYLl, and after treating the protruding ends with the Klenow 
polymerase, the chimeric module has been inserted into the Smal site of the 
glutamine synthetase (GS) vector, pEE14 (Cockett et al, 1990, Bio/Technology 8, 
662-667). The resulting plasmid pEE14 Fs + a RSV x HN s"a" MuV contains 

15 sequences coding for amino acids 1 to 526 of the RSV F protein fused to amino 
acids 60 to 582 of the MuV HN protein under the control of the major immediate 
early promoter of the human cytomegalovirus (hCMV-MlE) (FiglOB). 

b) The RSV fusion protein lacking its membrane anchor domain linked to the 
20 PIV3 hemagglutinin-neuraminidase lacking its signal-anchor domain, F^y (1- 

526) HN Prv3 (70-572). 

Plasmids pNIV4105 and pNIV4109 (FigllA and B, see supra, x HN Prv3 into 
the pUC19 vector) were digested by EcoRl and Xhol and a 2032 bp as well as a 
25 1064 bp fragments were isolated. Both fragments were inserted together into the 
EcoW site of pEE14. The resulting plasmid pEE14 Fs + a RSV x HNsa* PIV3 

contains sequences coding for amino acids 1 to 526 of the RSV F protein fused to 

* 

amino acids 70 to 572 of the PIV3 HN protein under the control of the hCMV 

■•> • 

promoter (Fig 11 C). 

30 
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c) The PIV3 fusion protein lacking the membrane anchor region linked to the 
RSV attachment protein lacking the signal-anchor d main, F PTVy (1-492) 
(69-298), 

5 Plasmid pNIV4103 (Figl2A, see supra, x G^v into the pUC19 vector) was 
digested by Hindlll and a 2180 bp fragment was isolated. After treating the 
protruding extremities with the Klenow polymerase, the chimeric module was 
inserted into the Smal site of the pEE14 vector. The resulting plasmid, pEE14 Fs + a 
PIV3 x Gs a'RSV, contains, under the control of the hCMV promoter, the sequence 
10 encoding amino acids 1 to 492 of the PIV3 F protein followed by amino acids 69 to 
298 of the RSV G protein (Fig 12B). 

d) The PIV3 fusion protein lacking the membrane anchor domain fused to the 
MuV hemagglutinin-neuraminidase lacking the signal-anchor domain, (1- 

15 493) HN MuV (60-582). 

Plasmid pN!V4117 (Figl3A, see supra, F^ HN MuV into the pUC19 vector) was 
digested with Hindltl and a 31 19 bp fragment was isolated and inserted into the 
Hindlll site of the pEE14 vector. The resulting plasmid, pEE14 Fs+a PIV3 x HNs 
20 a' MuV, contains under the control of the hCMV promoter a sequence encoding 
amino acids 1 to 493 of the PIV3 fusion protein fused to amino acids 60"to 582 of 
the MuV HN protein (Fig 13B). 

e) The MuV fusion protein lacking its membrane anchor domain fused to the 
25 RSV attachment protein lacking its signal-anchor domain, F^y (1-482) G^ 

(69-298). 

Plasmid pNTV4113 (Figl4A, see supra, F MuV G^v into the pBluescript vector) has 
been digested Asp718l, the protruding ends have been treated by the Klenow 
30 polymerase. A 2200 bp fragment has been isolated and inserted into the Smal site of 
pEE14. The resulting plasmid, pEE14 FsV MuV x Gsa'RSV. has, under the 
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control of the hCMV promoter, the sequence encoding amino acids 1 to 482 of the 
MuV F protein followed by amino acids 69 to 298 of the RSV G protein (FigHB). 

f) The MuV fusion protein lacking its membrane anchor domain fused to the 
5 PIV3 hemagglutinin-neuraminidase lacking its signal-anchor domain, F^y (1- 
482) HN prv3 (54-572). 

Plasmid pNIV4 115 (Figl5A, see supra, F MuV x HN prV3 into the pBIuescript vector) 
has been digested with EcoRl and a 3040 bp fragment has been inserted into the 
10 EcoRI site of the pEE14 vector. The resulting plasmid, pEE14 Fs + a" MuV x HNsa' 
PIV3, contains, downstream to the hCMV promoter region, a sequence coding for 
amino acids 1 to 482 of the MuV F protein followed by amino acids 54 to 572 of 
the PIV3 HN protein (Figl5B). 

15 g) The RSV fusion protein lacking its membrane anchor domain linked to the 
RSV attachment protein lacking its signal-anchor domain, F^y (1-526) 0^(69- 
298). 

Plasmid pNIV2857 (Fig 17 A), a derivative of pNIV2841 and which contains the 
20 DNA sequence coding for amino acids 1 to 526 of the RSV fusion protein linked to 
amino acids 69 to 298 of the RSV attachment protein, has been digested by Asp718I 
and Hindlll and a 2180 bp fragment has been isolated. After treating the protruding 
extremities with Klenow's polymerase, this fragment has been inserted the Smal site 
of the pEE14vector. The resulting plasmid, pEE14 FsVRSV x Gs a RSV, contains 
25 under the control of the hCMV promoter the DNA sequence coding for amino acids 
1 to 526 of the RSV fusion protein linked to amino acids 69 to 298 of the RSV 
attachment protein (Fig 17B). 

h) The original RSV fusion protein lacking the membrane anchor domain 
30 linked to the PIV3 hemagglutin-neuraminidase lacking the signal-anchor 
domain, F wv (1-526) HN Pm (70-572) bis. 
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Plasmid pNIV2852, a derivative of pNTV2820 which carries the DNA encoding the 
RSV F protein where the translation initiation site is in a more favourable context 
according to the model proposed by Kozak (Kozak M, Nature 308, 241-246, 1984), 
has been digested BamHI and BspHI, and a 1588 bp fragment has been isolated. 

Plasmid pIBI-HN, a cDNA clone containing the complete coding sequence of the 
HN protein of PIV3 (received from Dr. K. Dimock, University of Ottawa, Canada) 
has been digested by Asel and BamHI and a 1468 bp has been isolated. 



10 Both fragments were linked together by two complementary synthetic BspHI-Asel 
adaptators (Fig 18 A) and were inserted into the BamHI site of the pUC19 vector 
leading to pNIV4120 (Figl8B). 

4 

After the sequencing of the junction region, the chimeric cassette was retrieved by a 
15 BamHI digestion from pNIV4120 and inserted into the BamHI compatible Bell site 
of the pEE14 vector. The resulting plasmid pEE14 Fs+a'RSV x HNs'a' PP/3 bis 
contains the sequences coding for amino acids 1 to 526 of the RSV F protein fused 
to amino acids 70 to 572 of the PIV3 HN protein under the control of the hCMV 
promoter (Fig 18C). 

20 

This construct differs from the earlier pEE14 Fs + a'RSV x HNs'a' PIV3 construct 
(Il-a) in the F coding region. In FasyHNp,^ bis, the nucleic acid sequence found in 
FrsvHN^vj, ATG GAT CTG (those codons are specifying aa Metl, Asp2 and Leu3) 
and ACC AGT (specifying aa Thr54 and Ser 55) is replaced by the original 
25 sequence of the RSV F protein that is ATG GAG TTG (specifiyng aa Metl , Glu2, 
Leu3) and ACT AGT (specifying Thr54 and Ser55). 

i) The original RSV fusion protein lacking the membrane anchor domain linked 
to the PIV3 hemagglutinin-neuraminidase lacking the signal-anchor domain 
30 with, at the C-terminal part, a polyhistidine tail preceded by the enterokinase 
cleavage site, (1-526) HN Prv3 (70-572) en his 
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Plasmid pIBI-HN, a cDN A containing the PIV3 HN protein coding sequence (see 
supra) has been digested by PstI and Sphl. A 4588 bp fragment has been isolated 
and linked to complementary synthetic Pstl-SphI adaptators (Fig 19 A). 

5 After the sequencing of the junctions as well as the synthetic linkers, the resulting 
plasmid pNTV3340 has been digested by Xhol and BamHI and a 1 121 bp fragment 
has been isolated (Figl9B). 

Plasmid pNTV4120 (see supra) has been digested by Xhol and BamHI and a 2017 bp 
10 fragment has been isolated (Figl9C), 

Both fragments were linked together and inserted into the BamHI compatible Bell 
site of the pEE14 vector. The resulting plasmid pEE14 FRSVs + a x HNs'a" en his 
contains, under the control of the hCMV promoter, sequences coding for amino 
15 acids 1 to 526 of the RSV fusion protein fused to the amino acids 70-572 of the 
PIV3 HN protein fused to the enterokinase cleavage site,({Asp} x4 Lys) followed 
by a polyhistidine tail ({his}x6) and a stop codon (Figl9D). 

20 j) The signal domain of the tissue plasminogen activator fused to the yeast 
ubiquitin followed by the enterokinase cleavage recognition site and the 
original RSV fusion protein lacking Us membrane signal and anchor domains 
linked to the PIV3 hemagglutin-neuraminidase lacking the signal-anchor 
domain, sTPA(l-21) UB(l-74) ent F,^ (24-526) HN PIV3 (70-572)bis. 

25 

1) The signal domain of the tissue plasminogen activator fused to the yeast 
ubiquitin. 

A 208 bp fragment corresponding to amino acid 1 to 76 of the ubiquitin protein of 
30 Saccharomyces cerevisiae was isolated by a digestion of pNIV3475 ( a derivative of 
YEPUBSTUALL, a yeast 2 \i vector backbone carrying the yeast ubiquitin) with 
BamHI and Xbal (Fig 20A). 
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Plasmid JW4304 (received from J. Mullins, University of Washington, U.S. A) 
which encodes the signal domain of the tissue plasminogen activator (sTPA) was 
digested by Nhel and BamHI and a 5115bp was isolated. Both fragments were 
5 linked together using, two synthetic complementary Nhel-Xbal adaptators (Fig20B). 
The resulting plasmid pNTV4121 was digested by Hindlll and BamHI. A 330 bp 
fragment was isolated and inserted into the Hindlll and BamHI sites of the 
pBluescript vector. The resulting plasmid pNIV4122 contains the DNA sequence 
specifying the signal domain of the tissue plasminogen activator followed by an 
10 alanine and a serine residue (those two amino acids are known to produce a good 

leader cleavage) fused to the yeast ubiquitin (Fig 20C). 

. . ■ - - .■(■". 

2) The signal domain of the tissue plasminogen activator linked to the yeast 
ubiquitin followed by the enterokinase cleavage recognition site and amino acid 
15 24 to 55 of the original fusion protein of RSV. 

Plasmid pNIV4122 (Fig 21 A, see supra) was digested by Aflll and SpeL A 3212 bp 
fragment was isolated and linked to synthetic complementary Aflll-Spel adaptators 
(Fig21B). The entire module was then sequenced. The resulting plasmid pNIV4123 
20 encodes the signal domain of the tissue plasminogen activator linked to the N- N 
terminal 74 aa of the yeast ubiquitin followed by the recognition site of enterokinase 
{.(Asp)4 Lys} and amino acid 24 to 55 of the original fusion protein of RSV 
(Fig21C). 

25 3) The signal domain of the tissue plasminogen activator linked to the yeast 
ubiquitin followed by the enterokinase cleavage recognition site and the RSV 
fusion protein linked to the PIV3 hemagglutin-neuraminidase lacking their 
membrane domains. 

30 Plasmid pNIV4123 (Fig 22A, see supra) was digested by Hindlll, treated by the 
Klenow polymerase and digested by SpeL A 408 bp fragment has been isolated. 
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Plasmid pNTV4120 (Fig 22B, see supra) has been digested by Xbal, treated by the 
Kienow polymerase, and digested by SpeL A 5620 bp fragment has been isolated. 



5 



Both fragment have been linked together to generate pNIV4124 (Fig 22C) 



The entire coding module was retrieved from pNIV4124 by a digestion with Xbal 
and EcoRI and was inserted into the Xbal and EcoRI sites of the pEE14 expression 
vector. The resulting plasmid pEE14 sTPA x UBI x EN x Fs a RSV x HNs a PIV3, 
contains, under the control of the hCMV promoter, the sequence coding for aal-21 
10 of the tissue plasminogen activator followed by an alanine and a serine residue, by 
the 74 N-teraiinal amino acids of the yeast ubiqiiitin, by the recognition cleavage 
site of the enterokinase ({Asp}4 Lys), by aa 24-526 of the original RSV fusion 
protein and by aa 70-572 of the hemagglutin-neuraminidase of PIV3 . 

15 III) For transfection into Insect Cells 

4 * 

a) The original RSV fusion protein lacking the membrane anchor domain 
linked to the PiV3 hemagglutin-neuraminidase lacking the signal-anchor 
domain, Fr^v (1-526) HNpyvi (70-572) bis. 

20 • 

Plasmid pNTV4120 (FIG 23A) was digested by BamHI and a 3114 bp fragment was 
isolated and inserted into the BamHI site of the baculovirus transfer vector 
pAcUW51 (PharMingen). The resulting plasmid pNTV4132 (Fig 23B) contains, 
under the control of the polyhedrin promoter, the sequence coding for amino acids 
25 1-526 of the RSV F protein fused to amino acids 70-572 of the PiV3 HN protein. 

>. • ■ . , 

b) The baculovirus gp67 signal peptide fused to the original RSV fusion protein 

lacking both membrane signal and anchor domain linked to the PiV3 
hemagglutin-neuraminidase lacking the signal-anchor domain, sGP67Fjmr (25- 
30 526) HN^ (70-572) bis! 
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Plasmid pNIV4120 (FIG 24A f see supra) was digested by BamHI and Spel and a 
2939 bp fragment was isolated, linked to two complementary synthetic BamHI-Spel 
adaptators and inserted into the BamHI site of the baculovims transfer vector, 
pAcGP67A (PharMingen). The resulting plasmid pNIV4136 (Fig 24) contains, 
5 under the control of the polyhedrin promoter, the sequence coding for amino acids 
1-38 of the Baculovinis gp67 protein, followed by an Alanine and an Aspartate 
linked to amino acids 25-526 of the RSV F protein fused to amino acids 70-572 of 
the PiV3 HN protein. 

10 Expression in eukaryotic cells 

A) via the pSFVl vector 

The pSFVl vector is based on the Semliki Forest Vims (SFV) replicon. The DNA 
15 of interest is cloned into the pSFVl vector that serves as a template for in vitro 

synthesis of recombinant RNA. The RNA is transacted into mammalian cells such 
as BHK-21 cells. The recombinant RNA in the cells drives its own replication and 
capping resulting in production of heterologous protein. 

■\ ■ 

20 Plasmids pNIV2870 was digested with Pvul; pNTV4106, pNIV4110, pNlV4114, 
pNIV41 16 and pNIV4 118 were digested with Spel prior to RNA transcription. 
After a phenol extraction followed by an ethanol precipitation, 2 /ig of linearized 
DNA was used as a template for RNA production. About 5 fig RNA was used to 
transfect. by electroporation, about 8 10 6 BHK-21 cells. All experimental 

25 procedures for RNA production and cell transfection are detailed in Liljestrom and 
Garoff (Bio/Technology, 1991, 9, 1356). 

After 24 h to 48 h post-electroporation, cells and spent culture medium have been 
collected for ELISA and radioimmunoprecipitation assays. 
30 a) pNIV4104, HN MuV 
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ELISA were done using mAb 2072 anti-HN MuV (Orvell, 1984, J. Immunology 
132, 2622-2629) or 20RG45, a goat anti-RSV serum (Fitzgerald, U.S.A.) to coat 
the microti ter plates and a rabbit polyclonal anti-SBL-1 (MuV) serum or mAb 19 
anti-F RSV (G.Taylor, Inst, of Animal Health, Compton Lab., U.K.) as capture 
antibody. 

Radioimmunoprecipitation of the 35 S-methionine labelled product was done using 
mAb2072 (Orvell) and products were resolved onto 7.5% SDS-PAGE. 



10 b) pNIV4110, Frsv HN 



20 



PTV3 



ELISA were done using anti-RSV goat serum 20RG45 or mAb anti-HN^ 4830 
(Rydbeck et al y J, Gen. Virol. 67, 1531-1542, 1986) to coat microliter plates and 
mAbl9 anti-F RSV (G.Taylor) or rabbit anti-PIV3 (E.Norrby, Stockholm) serum as 
15 a capture antibody. 

Radioimmunoprecipitation was done using anti-HN PIV3 mAb4830. 



c) pNIV4106, F PIV3 Gnsv 



ELISA were done using mAb anti-F P1V3 4549 (E.Norrby, Stockholm) or mAb and 
Grsv 858-2 (Chemicon, U.S.A.) to coat microtiter plates and a rabbit anti-PIV3 
serum as a capture antibody. 

25 Radioimmunoprecipitation was done using mAb anti-F Prv3 3283 (Behringwerke). 
d) pNIV41I8, F Prv3 HN MuV 

ELISA plates were coated with anti-F PIV3 mAb 1031215 (Norrby) or with mAb 
30 2072 anti-HN MuV (Orvel) and rabbit anti-PIV3 sera or rabbit anti-MuV sera were 
used as capture antibody. 

Immunoprecipitation of labelled product was done using mAb 2072 anti-HN MuV. 
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e) pNIV4114, F MuV x 

ELISA plates were coated with anti-F MuV monoclonal 5414 (Orvell) or anti G^ v 
5 mAb (Chemicon) and a rabbit anti-SBL-1 serum was used as a capture antibody. 

0 pNTV4116, F MuV x HNpjvs 

ELISA plates were coated with anti-F MuV mAb 5414 (Orvell) or mAb anti-HN 
10 PIV3 4830 (Norrby) and rabbit anti-SBL-1 serum or a rabbit anti-PIV3 serum as a 
capture antibody. 

g) pNTV2870, F RSV x 

15 ELISA were done using 20RG45, a goat anti-RSV serum (Fitzgerald, U.S.A.) to 
coat the microtiter plates and mAbl9 anti-F RSV (G.Taylor, Inst, of Animal 
Health, Compton Lab., U.K.) as capture antibody 



20 B) Expression in CHO cells (stable transfonnants) 



25 



30 



All recombinant plasmids were transfected by calcium phosphate coprecipitation 
into CHO-KI cells, using 20 ^g DNA per 1.25 10 6 cells. The CHO-KI cells were 
grown in GMEM-S medium. The GS transfectants were selected by adding 25 fiM 
methionine sulfoximine to the culture medium two days after transfection. After ten 
to fourteen days, resistant colonies were picked and transferred into 96 wells plates. 
Each transformant was then transferred into 24 wells plates and subsequently to 80 
cm 2 flasks. The GS transformants were assayed for the recombinant products when 

+ 

cells reached about 80% confluency. The procedure follows the one described in 
Cockettff a/ (Bio/Technology, 1990, 5, 662-667). 
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ELIS A and immunoprecipitation of radiolabeled products were done using the same 
procedures as the ones described above for the pSFVl system. 

Results 



5 
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Expression in Insect cells 

10 a) Expression in lepidopteran cells. 

The vector pAcUW5 1 is a shuttle vector for bacteria and lepidopteran cells. A 
heterologous protein coding sequence can be inserted downstream the baculovirus 
plO promoter or either downstream the polyhedrin promoter. 

15 

The pAcGP67 vector is a shuttle vector for bacteria and lepidopteran cells that 
contains the gp67 signal sequence upstream a multiple cloning site. A heterologous 
gene can be inserted in one of the cloning site and will be expressed as a gp67 
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signal peptide fusion protein under the control of the polyhedrin promoter. The 
gp67 signal peptide mediates the secretion of the recombinant protein. 

Either pAcUWSl or pAcGP67 recombinant plasmid can be transfected along with 
5 baculovirus linearised DNA into Sf9 cells (Baculogold DNA, PharMingen). This 
leads to the generation of a recombinant baculovirus stock. The expression of the 
recombinant heterologous protein is obtained by infecting insect cells with the 
recombinant baculovirus 

10 Plasmid pNIV4132 or plasmid pNIV4136 were transfected with baculovirus linearised 
DNA into Sf9 cells. Recombinant baculovirus 3546 (derived from cells transfected by 
pNIV4132) or 5V (derived from cells transfected by pNIV4136) were plaque purified 
and were used to infect Sf9 or High Five™ cells (Invitrogen). 24h to 72 h post- 
infection tKe cells and the spent culture medium have been collected for ELISA and 

15 Western blot analysis. 

ELISA were done using anti-RSV goat serum 20RG45 (Fizgerald) to coat microtiter 
plates and mAbl9 anti-F RSV (G.Taylor) as a capture antibody. 

20 Western blots were done using mAbl9 anti-F RSV (G.Taylor) or using anti-RSV 
goat polyclonal serum 20RG45 (Fizgerald). 

t 

The spent medium from cells infected by either baculovirus 3546 or by 5V tested 
positive in ELISA. The level of expression, depending on the, host cell line (SF9 or 
25 High Five), multiplicity of infection, medium (fetal calf serum supplemented or 
serum free synthetic medium) was at least ten times higher than the one obtained 
with a recombinant CHO-KI clone obtained by transfection with pEE14 F^ (1- 
526) HN PiV3 (70-572)bis . 

30 In addition, the spent medium of the baculovirus infected cells reacted positively in 
Western blot. A band in the vicinity of 1 lOkDa was present in the immunoblots. These 
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results confirm the secretion of the chimeric F RSV -HNp?v3 into the medium of Sf9 and 
High Five cells infected with the recombinant baculoviruses. 



5 



25 



b) Purification of the recombinant product 



SF9 cells, adapted to serum free medium, were infected with the plaque purified 
recombinant baculovirus V5 or 3546. The cells were grown in suspension in 500ml 
Erlenmeyer flask in SF900II medium (Gibco BRL). The medium from virus infected 
cells were harvested two days post-infection. The soluble FR S v-HN P iv3 product was 

10 purified from the medium of infected cells by immunoaffinity chromatography using an 
anti-F RSV monoclonal antibody, mAbl9. The anti-F monoclonal antibody was 
coupled to Activated CH Sepharose 4B (Pharmacia) following the manufacturer 
instructions. The immunoaffinity gel was washed 3 times with 10 bed volumes of 
buffer A (20mM phosphate buffer pH 6.4, NaCl 150mM) prior to sample loading. 

15 After 16 hours at 4°C, the gel was washed with buffer A and the chimeric product was 
eluted with lOOmM phosphoric acid. Eluted protein was neutralized immediately with 
one tenth of volume of 1M phosphate buffer pH 7. 

SDS-PAGE of the immunoaffinity-purified F^-HN^ revealed the presence of a 
20 major protein band of about 110 kDa. This protein was visualized by Coomassie 

blue staining of the gel and reacted with the monoclonal antibody anti-F^y (mAbl9) 
or with the polyclonal serum (20RG45) on immiinoblots (Fig25). 



c) production of polyclonal antibodies 



In order to obtain specific antibodies, the baculovirus derived F R sv-HN Pi v3 protein, 
purified by immunoaffinity as described above, was used to immunise four BalbC mice 
and two New Zealand white rabbits. Three sub-cutaneous injections of 
20jig/ml/dose/rabbit or 6^g/100jil/dose/mouse were done at three weeks interval. The 
30 sera were collected 3 weeks after the second and the third injection and the antibody 
response was detected using ELISA and Western blots assays 
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1) FJ ISA assays 

a) ' Mice response 

The antibody response was followed using a goat anti-RSV serum (20RG45, 
Fitzgerald, USA) to coat the microliter plates and mouse anti-FHN sera as capture 
5 antibody. The antigens used were either the FaRsy-Drosophila or CHO derived, the 
Fpsv-HNpiva expressed in baculovirus and the medium of CHO cells transfected by 
the pEE14 was used as a negative control. 

< 3 our of 4 mice sera collected after the second injection showed some but low 
10 specific response. However, the mice sera collected after the third injection showed 
a high increase in level of specific antibodies. 

b) Rabbit response 

The antibody response was followed using either one of the following ELISA. The 
15 antigens were the same as the one used to detect the mice antibody response. 

Either a goat anti-RSV serum (20RG45, Fitzgerald, USA), either a monoclonal 
antibody directed against the RSV fusion protein (mAbl9, Compton Lab, UK) or a 
monoclonal antobody directed against the PiV3 hemagglutinin-neuraminidase 
(mAb3285 ) Behring) were used to coat the microliter plate and the rabbit anti-HN sera 
20 ; was used as a capture antibody. The first and the second test bleeds generated high 
specific antibodies. 

2) Western blot assays 

Recombinant Fa-RSV CHO-KI ou Drosophila derived, F R sv-HN P iv3 baculovirus 
25 derived or the CHO-pEE 14 spent medium culture were electrophoresed onto a 15% 
SDS-PAGE and transferred onto a nitrocellulose membrane (Amersham). The rabbit 
anti-HN sera as well as the mouse anti-HN sera detected specifically either the F 
protein or the Frsv-HNkvj chimera. 
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Example 2 

i) Optimization of the cod n usage of the nucleic acids sequence coding for the 
RSV fusion pr tein lacking the membrane anchor domain linked to the PiV3 
hemagglutin-neuraminidase lacking the signal-anchor domain, F RSV (1-526) 
5 HNpivs (70-572) for the expression in mammalian cells. 

A table showing the comparison of the codon usage found in the F^yHN^ module 
with the one found in highly expressed human gene can be found in Fig.26. As 
noted, the most prevalent codons found in the FgsvHNpiva module have an A or a T 

10 at their third degenerative position, whereas the human prevalent codons have a C 
or a G. For the improvement of the F^yHNp^ protein expression, the entire coding 
sequence has been re-engineered to fit at best the human codon usage. The re- 
engineered sequence was obtained using synthetic long oligonucleotides, polymerase 
chain reaction (PCR) and conventional cloning procedures. 

15 , . " 

Re-engineering of the coding sequence of the FrsvHNp^ module 
The entire synthetic sequence was recovered by joining three PCR fragments (A, B 
and C). The general strategy to obtain each PCR fragment is schematically 
represented in Fig 27. It consists of assembling overlapping long oligonucleotides in 
20 a first round amplification., The resulting full size fragment is further amplified 
using two short primers located on each of its extremities. 

Construction of fragment A 

The first PCR fragment, corresponding to 18 bases encoding restriction sites 
25 followed by bases 1 to 1269 of the F wv HN PiV3 followed by 8 bases encoding 
restriction sites, was obtained by PCR assembly of 18 overlapping oligonucleotides 
(Fig 28). This fragment has been inserted in the pCRIITOPO cloning vector 
(Invitrogen). After sequencing the fragment, it was retrieved from the pCRIITOPO 
vector by a Xbal and BsrCI digestion and inserted into the corresponding sites of 
pNrV4120. The module corresponding to F^yHNp^ with bases 1 to 1264 
humanized was then retrieved by an Xbal and EcoRI digestion and inserted into the 
corresponding sites of pEE14 (Fig.29) generating pEE14xF KV humHN PiV3 . 

- 39 - 



30 



DEC 04 2000 18=40 



PAGE . 42 




WO 00/18929 




PCT/EP99/07004 



C nstruction of fragment B 

The second PCR fragment B corresponding to 13 bases encoding unique restriction 
sites followed by bases 1264 to 2136 of F^yHNpivs was obtained by assembling 10 
oligonucleotides whose sequences can be found in Fig. 30. This fragment has been 
5 inserted in the pCRIITOPO vector and sequenced. This fragment has been 
recovered by a BsrGI and Kpnl digestion. 

Construction of fragment C 

The third PCR fragment corresponding to bases 2023 to 3090 followed by 6 extra 
10 bases encoding an EcoRI site has been assembled starting from the 15 
oligonucleotides shown in Fig 31. This fragment has been inserted in the 
pCRIITOPO cloning vector and sequenced. This fragment has been retrieved by a 
Kpnl and EcoRI digestion (Fig 31). , 

15 Construction of the entire coding sequence 

The entire F^yHNpiva codon optimized coding sequence has been obtained by 
assembling fragment A, B, C as shown in Fig.32. pNTV4120 in which the PCR 
fragment A has replaced the original sequence (see Fig.29) was digested by BsrGI 
and EcoRI. The original sequence was eliminated and replaced by the BsrGI- Kpnl 

20 fragment B and the KpnI-EcoRI fragment C. The codon optimized module was 

* 

retrieved from the PCRIITOPO vector by a Xbal and an EcoRI and inserted in the 
corresponding sites of the pEE14 vector. The resulting plasmid, pEEKF,^ 
humHNp iV3 hum, encodes for the entire humanized coding sequence. The humanized 
Fj^vHNp^ nucleic acids sequence is shown in Fig. 33. 

25 

Expression in CHO-KI cells 

The recombinant pEE14 F^v humHN PiV3 (see construction of fragment A, above, or 

recombinant pEEMFRsyhurnHNp^hum see construction of the entire coding 

30 sequence, above) was transfected using the FuGene reagent (Boeringer Mannheim), 

using 5 jig DNA per 1.25 10 6 cells. The CHO-KI cells were grown in GMEM-S 

medium. The GS transfectants were selected by adding 25 methionine 

sulfoximine to the culture medium two days after transfection. After ten to fourteen 
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days, resistant colonies were picked and transferred into 96 wells plates. Each 
transfonnant was then transferred into 24 wells plates and subsequently to 80 cm 2 
flasks. The GS transformants were assayed for the recombinant product when cells 
reached about 80% confluency. The procedure follows the one described in Cocketr 
5 el al (Bio/Technology, 1990, 8, 662-667). Alternatively, the expression was 

evaluated three to five days after the addition of sodium butyrate (2mM) in the cell 
culture. 

To compare the expression level to that of the non humanized FrjvHNpjvj, ELISA 
10 assays were done, using 20RG45, a goat anti-RSV serum (Fizgerald, U.S.A.) to 
coat the microtiter plates and mAbl9 anti-F RSV (G. Taylor, Inst, of Animal 
Health, Compton Lab, U.K.) as capture antibody. The expression level was 
estimated using a purified Fa-Rsv expressed in the Drosophila system. 

15 The level of expression of the non-humanized expressed product by 

pEEMFrsvHNkvj didn't exceed 0.03 mg/L and 0. 1 mg/L when sodium butyrate 
was added to the culture medium. The level of expression of the partially 
humanized product expressed by pEEMF^v humHNp iV3 , reached 1 mg/L and up, to 
3 mg/L when sodium butyrate was added in the culture medium. The humanization 

20 of the sequence coding for amino acids 1-423 of the 1029 amino acids thus 
enhanced the level of expression up to 30 fold (see Figure 34a). 



The level of expression of the entirely humanized product expressed by pEEMFrsv 
humHNpj V3 hum was at least of 2 mg/L and reached up to 50 mg/L when sodium 
25 butyrate was added in the culture medium. The humanization of the entire coding — 
region of F^nN^ enhanced the level of expression of at least 200 to 500 
fold (see Figure 34b). 



ii) Optimization of the codon usage of the nucleic acids sequence coding for the 
30 mumps virus (MuV) fusion protein lacking the membrane anchor domain 
linked to the measles virus (MV) lacking the signal-anchor domain, F Min , (1-482) 
H Mv (59-617) for the expression in mammalian cells. 
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A table showing the comparison of the codon usage found in the F M uvHmv module with 
the one found in highly expressed human gene can be found in Fig. 35. As it can be 
seen, the codon usage frequencies of this chimerical gene is quite different from those 
5 . prevalent in the human genome. For the improvement of the FmuvHmv protein 
expression, the entire coding sequence has been re-engineered to fit at best the human 
codon usage. The re-engineered sequence was obtained using synthetic long 
oligonucleotides, polymerase chain reaction (PCR) and conventional cloning 
procedures. 

10 

Re-engineering of the coding sequence of the F Mu vH M v module 

The entire synthetic sequence was recovered by joining four PCR fragments 
(A, B, C and D). The general strategy to obtain each PCR fragment is schematically 
represented in Fig 36. It consists of assembling overlapping long oligonucleotides in a 
15 first round amplification. The resulting full size fragment is further amplified using two 
short primers located on each of its extremities. 

Construction of fragment A 

The first PCR fragment, corresponding to 13 bases specifying restriction sites and a 
20 Kozak consensus motif followed by bases 1 to 1026 of the F^vHwv was obtained by 
PCR assembly of 12 overlapping oligonucleotides (Fig 37). This fragment has been 
inserted in the pCRIITOPO cloning vector (Invitrogen). After sequencing the 
fragment, it was retrieved from the pCRIITOPO vector by a Xbal and TspRI 
digestion and a 963 bp fragment was further purified, leading to fragment A. 

25 

Construction of fragment B 

The second PCR fragment B corresponding to bases 965 to 1712 of F^yH^ was 
obtained by assembling 9 oligonucleotides whose sequences can be found in Fig. 38. 
After its insertion into the pCRIITOPO vector and its sequencing, this 785 bp 
30 fragment has been recovered by a TspRI and Aval digestion. 
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Construed n f fragment C 

The third PCR fragment C corresponding to bases 1712 to 2485 has been assembled 
starting from the 11 oligonucleotides shown in Fig 39. It has been inserted in the 
pCRIITOPO cloning vector and sequenced. This 774 bp fragment has been retrieved 
S by an Aval and Apal digestion. 

Construction of fragment D 

The fourth PCR fragment D corresponding to bases 2485 to 3139 followed by 8 bp 

specifying a unique restriction site has been assembled starting from the 8 

i 

10 oligonucleotides shown in Fig 40. This fragment has been inserted in the 
pCRIITOPO vector and sequenced. A 657 bp fragment has been recovered after an 
Apal and EcoRI digestion. 

Construction of the entire coding sequence 

15 The entire F MuV H Mv cbdon optimised coding sequence has been obtained by 
assembling fragment A, B, C, D and inserting the module digested by Xbal and 
EcoRI into the corresponding sites of the pEE14 vector (Fig. 41). The resulting 
plasmid, pEEMF^vhumH^hum, encodes for a humanised sequence coding for aa 
1-482 of the mumps virus fusion protein followed by aa 59-617 of the measles 

20 virus. The humanised and original F^yH^v nucleic and amino acids sequences are 
shown in Fig. 42. 

iii) Purification and analysis of FHN expressed in CHO-KI 

25 a) Purification 

CHO cell line expressing secreted recombinant FHN was cultivated in cell factories in 
G-MEM medium supplemented with 2% FCS, in presence or absence of 1% Butyrate 
Na. FHN was purified by immunoaffinity chromatography by loading spent culture 
medium onto a Mabl9-sepharose column as described using the same experimental 
30 conditions. 



-43 - 



DEC 04 2000 18=41 



PAGE . 46 




WO 00/18929 PCT/EP99/07004 



When expressed in absence of Butyrate Na, purified FHN migrated on SDS-PAGE, in 
heating and reducing conditions, mainly as a band of 1 10 kDa. In contrast, FHN is 
visualized as a triplet of 1 10, 120 and 130 kDa when CHO cells are cultivated with 
butyrate. Heating has a more drastic effect than reducer on the FHN electrophoretic 
5 migration. Indeed, high molecular weight species are clearly detected in the 

preparation when electrophoresis proceeded without heating suggesting the presence 
of FHN aggregates or oligomers. These aggregates did not seem to be contaminated 
by CHO proteins. Antibodies directed to CHO proteins did not specifically recognize 
on Western blot any bands. Glycan analysis was performed using several lectins 
10 specific for different carbohydrate moieties. Surprisingly, FHN did not carry sialic 

acids or high-mannbse structures but carbohydrates of galactose-acetyl-galactosamine 
type characteristic of hybrid N- and/or O-glycosylations. 

N-terminal micro sequence analysis showed mainly the presence of Fl subunit in bands 
15 of 1 10-130kDa. The F2 N-terminal amino acid sequence detected in bands of lower 
and higher molecular weight indicated that some purified FHN molecules are present 
under a F0 form (non mature F). 

The presence of aggregates or oligomers in the FHN preparations was'confirmed by 
20 gel filtration analysis and proteins were detected by laser-light. scattering. Whatever the 
culture conditions (butyrate or not), between 50 and 65% of FHN populations 
displayed a molecular weight higher than 10 Da demonstrating that FHN is 
aggregated. 5 to 15% has a molecular weight ranging from 400 to 900 kDa whereas 
30 to 35% is monomelic FHN. v 



25 



b) Serum immunoglobin analysis. 



Immunisation protocol 

The F RS vHNp»v3 protein was purified from the spent medium culture of the CHO-KI 

30 cells transfected by the recombinant pEE14 F R svhumHNp,-v3hum by immunoaffinity 

chromatography as described (Purification of the recombinant product expressed in 

baculovirus recombinant infected SF9 cells). The product was injected in 7 groups of 

Balb CI mice as descibed in the following table 1 , 
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Humoral response directed against the FHN protein 

The humoral response directed against the FHN protein was determined. To this end, 
ELIS A plates were coated with immunoaffinity purified FHN protein. 

5 

Total IgG (Fig 43) 

To detect specific anti-FHN total IgG, ELIS A plates were coated with 200ng of 
immunoaffinity purified FHN protein, plates were then saturated and dilutionsof the 
mice second bleed sera were then applied. Total IgG were detected using a biotinylated 
10 serum directed against mouse IgG. 

IgGl (Fig 44) 

To detect specific anti-FHN JgG 1 , ELIS A plates were coated with lOOng of 
immunoaffinity purified FHN protein, plates were then saturated and dilutionsof the 
15 mice second bleed sera were then applied. IgGl were detected using a biotinylated 
serum directed against mouse IgGl. 

IgG2a(Fig45) 

To detect specific anti-FHN IgG2a, ELIS A plates were coated with lOOng of 
20 immunoaffinity purified FHN protein, plates were then saturated and dilutionsof the 
mice second bleed sera were then applied. IgG2a were detected using a biotinylated 
serum directed against mouse IgG2a. 

> 

The titer of each sera was determined and a mean titer for each group was calculated 
25 and is reported in table 2. These experiments show that the FHN antigen by itself or 
formulated with adjuvant (group 1 to 3), stimulates a specific humoral response. 
Indeed, no anti-FHN antibodies are generated in the untreated mice group (group 5) or 
in the group immunised solely with the adjuvant (group 4). The group 1 (and group 4) 
adjuvant was 3D-MPL and QS21 formulated with cholesterol containing liposomes as 
30 described in WO 96/33739; the group 2 adjuvant was alum. • 

The IgGl/IgG2a ratio indicates the Thl or Th2 orientation of the immune response; 

(Table2), a protective response against both the RSy or the PiV3 should tend toward 

-45- 



DEC 04 2000 18=42 



( 



PAGE . 48 




WO 00/18929 * PCT/EP99/07004 

the Thl type, that is a low IgGl/IgG2a ratio. In this regard, the responses generated 
with the FHN formulated in the presence of the 3D-MPL + QS21 adjuvant appears to 
be the more promising one. 

5 Table 1: Experimental procedures 
Immunogenicity FHN in 
mice 



Group 


n 


Vol 
(HO 


route 


Antigen 


Immuno- 
stimulants 


buffer • 


preservative 

4 

J> 


nature 


dose 
(ug) 


1 


12 


2x50 


IM 


FHN 


2 


3D-MP1V 
QS21 


PBS mod 
pH7.4 


thiomersal low 
(lug/ml) 


2 


12 


2x50 


IM 


FHN 

1 

^ 


2 


AI(OH)3 


PBS mod 
pH7.4 


thiomersal low 
(lug/ml) 


3 


12 


2x50 


IM 


FHN 


2 


/ 


PBS mod 
pH7.4 


thiomersal low 
(lfig/ml) 


4 


12 


2x50 


IM 


/ 


/ 


3D-MPL/ 
QS21 


PBS mod 
pH 7 4 


thiomersal low 
(1 ug/ml) 


5 


12 


/ 


/ 


untreated 


/ 


/ 


/ 


/ ' 


6 


12 


2x30 


IN A 


RSV live 




/ 


/ 


/ • 


7 


12 


2x30 


INA 


PIV-3 live 




/ 


1 


/ 



IM=intra-muscular 
INA=intra-nasaI 
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Antigen 


cc. ng/ml 


Buffer 


RS V live 


6.2 

logPFU/ml 




PIV-3 
live 


6.7 

logPFU/ml 




FHN 


120 (2.5ml) 


PBS 
pH7.3 



Time schedule: 
5 Injection 1 = Day 0 
Injection 2 = Day 28 
First Bleed = Day 28 
Second bleed - Day 42 
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Antigen 


cc. ng/ml 


Buffer 


RS V live 


6.2 

logPFU/ml 




PIV-3 
live 


6.7 

logPFU/ml 




FHN 


120 (2.5ml) 


PBS 
pH7.3 



. Time schedule; 
5 Injection 1 = Day 0 
Injection 2 = Day 28 
First Bleed = Day 28 
Second bleed = Day 42 



£ 
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Table 2: Serum antibody response against FHN. 

The total IgG, IgGl and IgG2a was determined for each mouse sera. A mean titer for 
each group was then calculated and is reported in the table. 



group 
n° 


Immunogen 


I Total IgG 


IgGl 


IgG2a 


IeGl/IaG2a 


1 


FHN + 3D- 
MPL/QS21 


1182000 


109800 


305500 


0 36 


2 


FHN + Alum 


1 82200 


127100 


4429 


28 7 


3 


FHN 


44990 


22760 


1941 


1 1 73 


4 


adjuvant=from 

** 

group 1 


49 


32 


ND 


ND 


5 


untreated 


52 


ND 


ND 


ND 


6 


Live RS V 


12840 


748 


2718 


0.27 


7 


Live Pi V3 


10860 


2758 


2320 


1.19 



ND=undetermined, the titer being to low 
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Claims 

1 . A process for preparing a heterochimeric protein or an immunogenic 
derivative thereof comprising an immunogenic fragment of the fusion (F) protein of 
RSV, PIV1, PIV2, PIV3, MV or MuV and an immunogenic fragment of the 

5 attachment (G, HN or H) protein of RSV, PI VI, PIV2, PIV3, MV or MuV which 
process comprises expressing recombinant DNA encoding the heterochimeric 
protein or immunogenic derivative thereof in CHO cells and recovering the protein. 

2. A process according to claim "1 wherein at least one non-preferred or less 

10 preferred codon in a natural gene or DNA encoding the said heterochimeric protein 
or immunogenic fragment thereof has been replaced by a preferred codon encoding 
the same amino. acid. 



3. A heterochimeric protein or an immunogenic derivative thereof comprising an 
15 immunogenic fragment of the fusion (F) protein of RSV, PIV1, PIV2, PIV3, MV 
or MuV and an immunogenic fragment of the attachment (G, HN or H) protein of 
RSV, PIV1, PIV2, PIV3, MV or MuV, with the proviso that where one of the 
immunogenic fragments is derived from RSV F, RSV G or PIV3 F, PIV3 HN, the 
other of the immunogenic fragments is derived from MuV F, MuV HN, MV F, 



20 



MV H, PI VI F,PIV1 HN, PIV2 F or PIV2 HN. 



25 



4. A process for preparing a heterochimeric protein or immunogenic derivative 
thereof as claimed in claim 3 which process comprises expressing recombinant 
DNA encoding the heterochimeric protein or immunogenic derivative thereof in 
either one of; CHO cells or insect cells and recovering the protein. 



5. A protein according to claim 3 wherein the immunogenic fragment of the F 
protein is lacking the membrane anchor domain at its C-terminal end. 



30 



6. A protein according to claims 3 or 5 wherein the immunogenic fragment of the 
G/HN or H protein is lacking the signal/anchor domain at its N-terininal end. 
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7. A protein according to any one of claims 3, 5 or 6 which is linked via an amino 
acid in the C-terminal pan of the immunogenic fragment of the F protein of RSV, 
PIV1, PIV2, PIV3, MV or MuV to an amino acid in the N-terminal part of the 
immunogenic fragment of the G protein of RSV or the HN protein of PIV1 , PIV2, 

5 PIV3, MuV or the H protein of MV. 

8. A protein according to any one of claims 3, 5, 6 or 7 which commences at its N- 
terminal end with a signal sequence from the F protein of RSV, PIV1, PIV2, PIV3, 
MV or MuV. 

10 

9. A protein according to any one of claims 3,5,6 or 7 which commences at its N- 
terminal end with a signal sequence from TPA. 

10. A protein according to any one of claims 3 or 5 to 8 which comprises a 

15 ubiquitiri leader sequence. > 

11. A protein according to any one of claims 3 or 5 to 9 which comprises a 
polyhisridine tail. 

20 12. A protein according to claim 10 or 11 which comprises a cleavage site for 
cleaving off the ubiquitin leader sequence and/or the polyhisridine tail. 

13. A heterochimeric protein according to any one of claims 3 or 5 to 1 1 which is 
selected from the group consisting of: 

25 Fs + aRSVxHNs a'MuV; 

FsV PIV3 x HNs a' MuV; 

Fs + a MuV x Gs a RSV; or - 

Fs + a" MuV x HNsa PIV3, or 

an immunogenic derivative thereof. 

30 

14. A heterochimeric protein according to any one of claims 3 or 5 to 1 1 which is 
selected from the group consisting of: 

i 
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Fs + a' MuV x Hs'a'MV; or 
Fs + aRSVx HNs a PIVl, or 
Fs+a RSVx HNs a*PIV2, or 
an immunogenic derivative thereof. 



10 



15. A heterochimeric protein which is: 

FsV (1-526) RSV x HNsa" (70-572) PIV3, 

Fs + a* (1^*92) PIV3 x Gs a (69-298) RSV, 

Fs + a (1-526) RSV x HNsa (70-572) PIV3 bis, 

Fs + a~ (1-526) RSV x HNs a* (70-572) PIV3 ent his, or 

sTPA (1-21) UB (1-74) ent Fsa (24-526) x HN sa(70-572) PIV3, or 

an immunogenic derivative thereof. 



16. Recombinant DNA encoding a heterochimeric protein or an immunogenic 
15 derivative thereof according to any one of claims 3 or 5 to 15. 

17. Recombinant DNA according to claim 16 in which at least one non-preferred 
or less preferred codon in the DNA has been replaced by a preferred codon 
encoding the same amino acid, 

18. DNA which hybridises under conditions of high stringency with the DNA of 
20 claim 16 or 17. 

! 

19. An expression vector comprising recombinant DNA according to claims 16 to 
18. 

20. A host transformed with DNA according to any one of claims 16 to 18 or with 
a vector according to claim 19, 

25 21. A host according to claim 20 which is a CHO cell. 

22. A host according to claim 21 which is an insect cell. 
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23/ A vaccine composition comprising a protein according to any one of claims 3 
or 5 to 13 or an immunogenic derivative thereof in admixture with a 
pharmaceutically acceptable carrier. 

24. A vaccine composition according to claim 23 farther comprising 3D 
5 Monophosphoryl lipid A and/or QS-21. 

25. A vaccine composition according to claims 23 or 24 wherein the carrier is an 
oil-in- water emulsion. 

26. A heterochimeric protein or an immunogenic derivative thereof according to 
any one of claims 3 or 5 to 15 for use in medicine. 

10 27. A process for the production of a heterochimeric protein' according to any one 
of claims 3 or 5 to 15 which process comprises expressing recombinant DNA 
encoding said protein or immunogenic fragment thereof in a host cell and 
recovering the protein. 

28. A method of treating a human or animal susceptible to paramyxoviridae viral 
15 , infections comprising administering an effective amount of a vaccine according to 

any one of claims 23 to 25. 

29. Use of a protein or an immunogenic derivative thereof according to any one of 
claims 3 or 5 to 15 in the manufacture of a medicament for use in the treatment of 
respiratory disorders. 



\ 
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Fig. 1 
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Fig. 2 
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Fig . 3 
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Fig. 4 



A) Synthetic adaptators 

5' C ATG AAT GAT CAA GGC TTG AGC AA 3' 

TTA CTA GTT CCG AAC TCG TTA GTC 

BspHI BbsI 



[SEQ ID NO: 1] 



B) pNIV4 1 02 



Hindlll BamHI 



BspHI BbsI 

1 1 



BamHI 



Frsv !- 526 



ATG 




HN MuV 60-5 82 \±—p\JC 1 9 



i 



STOP 



C)pNTV4104 



BamHI 



Frsv 1-526 



t 



ATG 



BamHI 



ss 



HN MuV 60-582 



iL_pSFVl 



t 



STOP 
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Fig. 5 



A) Synthetic adaptators 

5' C ATG AAC AAT GAG TTT ATG GAA GTT ACA GAA AAG ATC CAA 

TTG TTA CTC AAA TAC CTT CAA TGT CTT TTC TAG GTT 

BspHI 



ATG GCA TCG GAT ATT AT 3' 
TAC CGT AGC CTA TAA TATA 

Asel 



[SEQ ID NO: 2] 



B)pNIV4109 



BamHI 



Asel BspHI 



BamHI 



STOP 



i 




pUC19 



ATG 



C)pNIV4110 



BamHI 



ATG 



BamHI 




t-pSFVl 



STOP 
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Fig. 6 



A) Synthetic adaptators 

5' GAT CTA GAA GAG TCA AAA GAA TGG ATA AGA AGG TCA AAT CAA 
AT CTT CTC AGT TTT CTT ACC TAT TCT TCC AGT TTA GTT 

Bgin 

AAA CTA GAT TCC ATT GGA AAT TGG CAT CAA TCT AGC ACC 3' 
TTT GAT CTA TGG TAA CCT TTA ACC GTA GTT AGT TCG TGG CAGT G 

Maein 

[SEQIDNO:3] 



B)pNIV4103 



BamHI 



BgUI Maelll 



Hindin 




ii_pUC19 



ATG 



STOP 



C)pNTV4106 



SmaltBamHI 




ATG 



HindlU/Smal 



S_pSFVl 



STOP 
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Fig. 



A) Synthetic adaptators 

5'G ATC TAG AAG AGT CAA AAG AAT GGA TAA GAA GGT CAA ATC 
ATC TTC TCA GTT TTC TTA CCT ATT CTT CCA GTT TAG 

Bglll 

AAA AAC TAG ATT CCA TTG GAA ATT GGC ATC AAT CTA GCA CCA 
TTT TTG ATC TAA GGT AAC CTT TAA CCG TAG TTA GAT CGT GGT 



CAA ATG ATC AAG GCT TGA GCA A 3' 
GTT TAC TAG TTC CGA ACT CGT TAGTC 

Bbsl 



[SEQ ID NO: 4] 



B)pNIV4117 



BamHI 



ATG 



Belli Bbsl 



BamHI 



prv3 



1-493 




HN MuV 60-582 



±_pUC19 



T 



STOP 
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BamHI 











F PIV3 1-493 





ATG 



BamHI 




i_pSFVl 



r 

STOP 
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Fig. 8 



A) Synthetic adaptators . 

5' GAA TGC CGT TAA ATA CAT CAA GAG AGT AAC CAT CAA 

A CGT CTT ACG GCA ATT TAT GTA GTT CTC TCA TTG GTA GTT 

PstI 

CTC CAT CGG TCT GAG TAA GTT CTA AA 3 1 

GAG GTA GCC AGA GTC ATT CAA GAT TTC AGT [SEQ ID NO 

MacIII v 
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A) Synthetic adaptators 



9/73 
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CAAGATTTTTAA 
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Fig. 11 



A)pNTV4105 



11/73 



EcoRI 



! 



f rsv 1 ~ 526 



ATG 



Xhol 

1 




HN PIV3 70-572 



pUG-19 



t 



STOP 



B) pNIV4 1 09 



EcoRI 



Xhol 

1 




r 



STOP 



pUC19 



T 



ATG 



C) P EE14 FsV RSV x HN s'a PrV3 



EcoRI 




Xhol 

1 
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HN PIV3 70-572 



pEE14 



ATG 



STOP 
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A)pNIV4103 



12/73 



Hindlll 



Bglll Maelll 



Hindlll 











F PIV3 1.492 






1 



ATG 



±_pUC19 



I 



STOP 



B) pEE 1 4 Fs + a PIV3 x G sa RS V 



Hindlll 



Hindlll 




r 



ATG 



Grsv 69-298 



pEE14 
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STOP 
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Fig. 13 



A) pNIV4 1 1 7 



Hindlll 



Bglll Bbsl 

1 1 



Hindlll 




t 



ATG 



HN MuV 60-582 



^— pUC19 



STOP 



B) pEE14 FsVPIV3 x UN sYMuV 



Hindlll 



F r ,v3 1-493 



T 



ATG 



Hindlll 




HN MuV 60-582 



ii_pEE,14 



1 



STOP 
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Fig. 14 



A)pNIV4113 



Asp7 1 81 



ATG 



PstI Maelll 



Asp718I 







1 




FmuV 1-482 






Grsv 69-298 



^ — pBluescript 



T 



STOP 



B) pEE14 FsVMuV x G s a" RSV 



Smal/Asp718I 



T 



ATG 



Asp71 S//Smal 











F MuV l-482 






G^ 69-298 



±_pEE14 



1 



STOP 
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A) pNI V4 1 1 5 



15/73 



EcoRI 



Bsal (EcoRI) 



EcoRI 











F MuV 1-482 






HN P1V3 54-572 



▼ pBluescript 



T 



ATG 



STOP 



B) pEE14 FsVMuV x HNs'a" PIV3 



EcoRI 



EcoRI 



1 








F MuV 1-482, 






5 HN PIV3 54-572 



^pEE14 



T 



T 



ATG 



STOP 
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A) pNTV2857 



16/73 



Asp718I 



ATG 



RSV 



1-526 



Hindin 




'RSV 



69-298 



5_pUC19 



STOP 



B) pNIV2870 
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t 
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ATG 



Hindlll/Smal 




Grsv 69-298 



i^pSFVl 



STOP 
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A) pNIV2857 



17/73 



Asp718I 



Frsv 1-526 



i 



ATG 



Hindlll 




Grsv 69-298 



2— pUC19 



STOP 



B) pEE14 FsV-RSY x G sa RSV 
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I 



ATG 
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G RSV 69-298 



±_pEE14 



i 



STOP 
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Fig. 18 

A) Synthetic adaptators 

5' C ATG AAC AAT GAG TTT ATG GAA GTT ACA GAA AAG ATC 

TTG TTA CTC AAA TAC CTT CAA TGT CTT TTC TAG 

BspM 



CAA 
GTT 



ATG GCA TCG GAT ATT AT 3' 
TAC CGT AGC CTA TAA TAT A 

Asel 



[SEQ ID NO: 7] 



B)pNIV4120 
BamHI 



BamHI 



V 




ATG 



l p UC19 



STOP 



C) pEE14 F s + a RSV xHN sa PiV3bis 



BclI/BamHI 











Frsv 1-526 





ATG 



BamHI /Bell 



) 




▼_ P EE14 



I 



STOP 
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Fig. 19 

A) Synthetic adaptators 

PstI 5'GT GAC GAT GAC GAT AAG CAT CAT CAT CAT CAT CAT TAG 
ACGTC ACA CTG CTA CTG CTA TTC GTA GTA GTA GTA GTA GTA ATC 

GGATCCGCATG 3" 

CCTAGGC SphI [SEQ ID NO: 8] 



B) pNIV3340 



Xhol 



PstI 

i 



SphI BamHI 

! i 





HN PIV3 1-572 


Aspx4Lys Htsx6 
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pIBI 



t • 
ATG 



C)pNIV4120 

BamHI 



STOP 



Xhol 



BamHI 




pUC19 



STOP 



C) pEE14 F s + a' RSV xHN sa PiV3 enthis 

BamHI Xhol 
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1 
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1-526 
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Aspx4Lys isx6 ^ 



pEE14 
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STOP 
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A)pNTV3475 

EcoRI Xbal 



Saccharomyces cerevisiae Ubiquitin 



AfUI BamHI 



i_ P UC19 



ATG 



Gly 76 



B) Synthetic adaptators 



5' CT AGC ATG CAG ATC TTC GTC AAG ACG TTA ACC GGT AAA ACC 
Nhel G TAC GTC TAg' AAG CAG TTC TGC AAT TGG CCA TTT TGG 



ATA ACC 3' Xbal 
TAT TGG ATCT 



[SEQ ID NO: 9] 



C)pNTV4122 



Hindlll 



Nhel 



Xbal 



AfUI BamHI 





I 








aa 1-21 sTPA 


A-S 


aa 1 r76 Ubiquitin 










r 



pBluescript 



ATG 



ATG 



Gly76 
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Fig. 21 
A) pNIV4122 



Hindni 



AflllBamHI Spel 





aa 1-21 sTPA 


Ala-Ser 


aa 1-76 Ubiquitin 









pBluescript 



ATG 



Gly76 



B) Synthetic adaptators 

5'TTA AGA CTA AGA GAC GAT GAC GAT AAG TCC AGT CAA AAC 
Aflll CT GAT .TCT CTG CTA CTG CTA TTC AGG TCA GTT TTG 

■ ■/ 

ATC ACT GAA GAA TTT TAT CAA TCA ACA TGC AGT GCA GTC AGC 
TAG TGA CTT CTT AAA ATA GTT AGT TGT ACG TCA CGT CAG TCG 

AAA GGC TAT CTT AGT GCT CTA AGA ACT GGT TGG TAT A3' Spel 
TTT CCG ATA GTT TCT CGA GAT TCT TGA CCA ACC ATA TGA TC 

[SEQ ID NO: 10] 



C)pNIV4123 



Hindlll 



Aflll 



Spel 

1 





sTPA 


a-s 


aa 1-74 
Ubiquitin 


cnteroK 


aa 24-55 Fj^v 











pBluescript 



ATG 
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Fig. 22 



A)pNIV4123 
Hindlll 



Spel 





sTPA 


a | aa 1-74 


entero 


aa 24-55 




s LJbiquitin 


K 


Frsv 











T 



r 



ATG ATG 



ATG 



B)pNTV4120 

Xbal Spel 




pBlue script 



EcoRI 



+ pUC19 



C)pNIV4124 



Xbal 



Spel 

i 



sTPA 



a 1 aa 1 -74 


entero 


aa 24-55 


s Ubiquitin 


K 


Frsv 







F RSV 56-526 



EcoRI 



HN PIV3 70-572 



:L_pUC19 



ATG 



Stop 



D) pEE 1 4 sTPA UBI EN Fsa" RSV x HN PiV3 



Xbal) 



Spel 

i 



EcoRI 



sTPA 
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aa 1-74 


entero 


aa 24-55 




s 


Ubiquitin 


K 


Frsv 











Frsv 56-526 



HNp, V3 70-572 



Z^pEE14 
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ATG 
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Stop 
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A)pNTV4120 



23/73 



BamHI 



ATG 



BamHI 



RSV 



1-526 




HN PIV3 70-572 | 



I 



pUC19 



STOP 



B)pNIV4132 



BamHI 



BamHI 



Polyhedrin 
promoter 



RSV 



1-526 




HNprva 70-572 



p AcUW5 1 
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Fig. 24 



A) pNTV4120 



Spel 



BamHI 



RSV 



1-526 




pUC19 



ATG 



STOP 



B) Synthetic adaptators 

5 'GAT CAA AAC ATC ACT GAA GAA TTT TAT CAA TCA ACA TGC 
BamHI TT TTG TAG TGA CTT CTT AAA ATA GTT AGT TGT ACG 

AGT GCA GTC AGC AAA GGC TAT CTT AGT GCT CTA AGA ACT 
TCA CGT CAG TCG TTT CCG ATA GAA TCA CGA GAT TCT TGA 

GGT TGG TAT A 3 ' Spel 

CCA ACC ATA TGA TC [SEQ ID NO: 11] 



C)pNIV4136 



Bamffl Spel 



BamHI 











Polyhedrin 
promoter 






gp67 
signal 


F^ 25-526 mm HN PIV3 70-572 



P AcUW51 
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Fig 25: SDS-PAGE (reduced conditi ns) of the F RSV HN PiV3 protein purified by 
immunoaffinity from the spent culture medium of the recombinant baculovirus 3546. 

kDa: molecular weight marker K 
A: Coornassie blue staining 

B: Western blot revealed by a goat polyclonal anti-RSV serum 20 RG45 
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Fig , 26: Codon usage of F RSV HN PjVJ and highly expressed human genes (hum high exp) 
showing frequencies (xlOO) of the individual codons for each of the degenerately encoded 
amino acids, and the most prevalent codon in bold. 
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Fig. 27: Schematic diagram of the PCR synthesis of each fragment showing unique 
restriction sites along the sequence (black dots) and restriction sites (A and B) that allow 
retrieval of the full size fragment from the cloning vector. 
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PCR 
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ligation and insertion into the pcrllTOPO 
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Retrieval from the pcrllTOPO vector by restriction with 

enzyme A and B. 
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28: Sequence of the 18 oligonucleotides from which PCR fragment A was generated. 



1) olfhuml - seq, bases 1-90 of F RS vHNpiv3, homologous to mRNA 

5' cccTCTAGAGGATCCACCATGGAGCTGCTGATtttaAAGACCAACGCCATCACCGCCATCCTG 

GCCGCGGTGACCCTCTGCTTCGCGTCC 

2) olfhum2 . seq, bases 75-165 of F R svHN Pi v3r inverse complementary to 



5 ' CCTCAGCGCGCTCAGGTAGCCCTTGCTGACAGCagaGCAGGTGGACTGGTAGAACTCCTCGGTG 
ATGTTCTGGCTGGACGCGAAGCAGAGG 



3) olfhum3. seq, bases 150-240 of FR5vHNp iV 3, homologous to mRNA 

5 ' CCTGAGCGCGCTGAGGACGGGGTGGTACACtAGtGTGATCACCATCGAGCTGAGCAACATCAAG 

GAGAACAAGTGCAACGGCACCGACGCC 

4) olfhum4.seq, bases 225-310 of F RS vHNpi V 3, inverse complementary to 
mARN ^ : 

5'GCATCAGCAGCTGCAGCTCGGTCACGGCGCTCTTGTACTTGTCCAGCTCCTGCTTGATCAGCTT 
CACCTTGGCGTCGGTGCCGTTG 

5) olf hum5. seq, bases 295-397 of F RS vHNpiv3/ homologous to mRNA 

5' CTGCAGCTGCTGATGCAGAGCACCCCCGCCACCAACAACagaGCCAGGCGCGAGCTGCCCAGGT 

TCATGAACTACACCCTCAACAACACCAAGAACACCAACG 

6) olfhumG.seq, bases 378-496 of F RSV HN Pi v3, inverse complementary 
to mRNA 

GGTGCAGGACCTTGGACACCGCGATGCCGCTGGCGATGGCGGAGCCCACGCCCAGCAGGAAGCCCA 
GGAAgCGcctCTTgCgCTTCTTGCTCAGGGTCACGTTGGTGTTCTTGGTGTTG 

7) olfhum7.seq, bases 480-561 of F R svHNpi V3 , homologous to mRNA 

5 ' GTCCAAGGTCCTGCACCTGGAGGGGGAGGTGAACAAGATCAAGAGCGCCCTGCTCTCCACCAAC . 

"AAGGCGGTGGTCAGCCTG 

8) olfhum8.seq, bases 54 3-633 of F RS vHN Pi v3f inverse complementary 
to mARN 

5' GGGGAGCAatTGCTTGTCGATGTAGTTCTTGAGGTCCAGCACCTTGCTGGTCAGCACGCTCACG 
CCGTTGGACAGGCTGACCACCGCCTTG 

9) olfhum9.seq, bases 609-676 of F RS vHNpiv3, homologous to mRNA 

5 ' CTACATGGACAAGCAatTGCTCCCCATCGTGAACAAGCAGt cCTGCAGCATCTCTAACATTGAG 

ACCG 

10) olfhumlO.seq, bases 653-732 of F R svHN Pi v3, inverse complementary 
to mARN 

5 ' GCTGAACTCCCTGGTGATCTCGAGCAGCCTGTTGTTCTTCTGCTGGAACTCGATCACGGTCTCA 
ATGTTAGAGATGCTGC v 



mARN 
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11) olfhumll . seq, bases 714-787 of F R svHN P iv3, homologous to mRNA 

5' GATCACCAGGGAGTTCAGCGTGAACGCgGGcGTcACCACCCCGGTGAGCACCTACATGCTGACC 

* AACAGCGAGC 

12) olfhuml2 . seq, bases 768-846 of F RS vHNpiv3, inverse complementary 
to mARN 

5 ' GTTGGACATaAGCTTCTTCTGGTCGTTGGTGATGGGCATGTCGTTGATCAGGGACAGCAGCTCG 
CTGTTGGTCAGCATG 

13) olfhuml3- seq, bases 825-916 of F RS vHNpiv3/ homologous to 'mRNA 

5 ' CCAGAAGAAGCTtATGTCCAACAACGTGCAGATCGTGCGCCAGCAGAGCTACagCATCATGagC 

ATCATCAAGGAGGAGGTGCTGGCCTACG 

14) olfhuml4 . seq, bases 900-990 of , FRsvHNp iv3 , inverse complementary 
to mARN 

5 ' GGTGGTGCACAGGGGGGAGGTGTGGAGCTTCGAGCAGGGGGTGTCGATCACGCCGTACAGGGGC 
AGCTGCACCACGTAGGCCAGCACCTCC 

' 15) olfhumlS. seq, bases 975-1065 of FRsvHNpi V 3, homologous to mRNA 
5 ' CCCCCTGTGCACCACCAACACCAAGGAGGGCTCCAACATCTGCCTGACCCGCACCGACCGGGGC 

TGGTACTGCGACAACGCCGGCTCCGTG 

16) olfhuml6. seq, bases 1048rll33 of FR£vHN Pi v3, inverse 
complementary to mARN ■ ■ i- 

5 ' CTGTTCATGGTGTCGCAGAACACGCGGTTGGACTGCACCTTGCAGGTCTCCGCCAGGGGGAAGA 
AGGACACGGAGCCGGCGTTGTC t 

17) olfhuml7 . seq, bases 1116-1210 of FrsvHNpiv3 r homologous to mRNA 

5 ' CTGCGACACCATG7VACAGCCTGACCCTGCCCAGCGAGGTG7\ACCTCTGCAACATCGACATCTTC 

AACCCCAAGTACGACTGCAAGATtATGacctcc 

18) olfhuml8 . seq, bases 1195-1295 of FRsyHNpivs, inverse 
complementary to mARN 

gggaattctgtacacttggtcttgccgtagcaggacacgatggcgcccagggaggtgatcacggag 
ctgctcacgtcggtcttggaggtCATaATCTTGCAG 

[SEQUENCES ABOVE ARE SEQ ID NOs: 12 to 29, respectively] 
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Fig. 29: Construction of pEE14 F RSV hiimHNp ;V3 



a) PCR fragment A 



Xbal 



BsrGI 



F RSV hum 1-1264 



ATG 



b) pNIV4120 +PCR fragment A 



BsrGI 
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EcoRJ 
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PiV3 



-pEE14 
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Fig. 30: Sequence of the 10 oligonucleotides from which PCR fragment B was generated. 

1) olf hnhuml9 . seq, bases 1269-1353 of F R svHNp iV 3r homologous to mRNA 
5 1 cggcaagaccaagtgtacagcctccaacaagaaccgcggcatcatcaagaccttctccaacgg 

ctgcgactacgtgtccaacaag 3' 

2) olf hnhum20. seq, bases 1336-1428 of F RS vHN P iv3, inverse 
complementary of mARN 

5' cttcacgtacaggctcttgccctcctgcttgttcacgtagtacagggtgttgcccacggacac 
ggtgtccacgcccttgttggacacgtagtc 3 1 

3) olf hnhum21 . seq, bases 1413-14 97 of FRsvHNpiva, homologous to mARN 
5 1 gagcctgtacgtgaagggcgagcccatcatcaacttctacgacccgctggtgttcccctccga 

cgagtt cgacgcctccatctccc 3' 

4) olf hnhum22 .'seq, basesl4 8 3-1599 of F RS vHNp iV 3, inverse 
complementary of mARN 

5 ' gttcatgatgttggtggtggacttgccggcgttcacgttgtgcagcagctcgtcggacttgcg 
gatgaaggccaggctctggttgatcttctcgttcacctgggagatggaggcgtc 3 1 

5) olfhnhum23 . seq, bases 1581-1691 of F RS vHNp iV 3, homologous to mARN 
5 1 caccaccaacatcatgaacaacgagttcatggaggtgaccgagaagatccagatggcctccga 

caacatcaacgacctgatccagtccggcgtgaacacccggctgctgac 3 1 

6) olfhnhum24 . seq, bases 1677-1779 of F RS vHN Pi v3f inverse, 
complementary of mARN 

5 ? gatggtgatctcgctgatgaacttccgcaggtcggacatctgctgggtcagggagatggggat 
gtagttctgcacgtggctctggatggtcagcagccgggtg 3 1 

7 ) olf hnhum25 . seq, bases 1761-1865 of F RSV HNpiv3f homologous to mRNA , 
5 ' catcagcgagatcaccatccggaacgacaaccaggaggtgcccccccagaggatcacccacga 

cgtgggcataaagcccctgaaccccgacgacttctggcgctg 3 ' 

8 ) olf hnhum26 . seq, bases 1849-1967 of F RS vHN P i V 3, inverse 
complementary of mARN 

5-' gtgcgcacgcagccgtccacggtggtgggcatggccagcaggccgggcccgggcatcagcctt 
atcttgggggtcttcatcagggaggggaggccggaggtgcagcgccagaagtcgtc 3 ' 

9) olfhnhum27 . seq, bases 1953-2059 of F RSV HN P i V 3, homologous to mRNA 
5 1 cggctgcgtgcgcaccccctccctggtgatcaacgacctgatctacgcctacacctccaacct 

gatcacccgcggctgccaggacatcggcaagtcctaccaggtgc 3 1 

10) olf hnhum28 . seq, bases 2043-2154 of FrsvHNpivs, inverse 
complementary of mARN 

5 1 ggacttcctgttgtcgttgatgttgaaggtgtgggagatccgggggttcaggtcgggcaccag 
gtcggagttcacggtgatgatgccgatctgcagcacctggtaggacttg 

[SEQUENCES ABOVE ARE SEQ ID NOs : 30-39, respectively] 
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Fig. 31s Sequence of the 16 oligonucleotides from which PCR fragment C was generated. 

1) olfhnum2 9.seq, bases 2139-2229 of F R svHN Piv3 , homologous to mRNA 

5 ' cgacaacaggaagtcctgctccctggccctcctgaacaccgacgtgtaccagctgtgctccac 

gcccaaggtggacgagcgctccgactac 3' 

2) olfhnhum30. seq, bases 2214-2307 of F RSV HN PiV 3/ inverse 
complementary to mRNA 

5' gttcttgaagcgggtggtggagatggagccgtcgtggttgacgatgtccagcacgatgtcctc 
gatgccggagctggcgtagtcggagcgctcg 3 f 

3) olfhnhum31. seq, bases 2292-2398 of F RS vHN Piv3 , homologous to mRNA 
5' cacccgcttcaagaacaacaacatcagcttcgaccagccctacgccgccctgtacccctccgt 

gggccccggcatctactacaagggcaagatcatcttcctgggc 3' ^ 

4 ) olfhnhum32. seq, bases 2382-2472 of F RSV HN Pi v3, inverse 
complementary . to mRNA 

5' ccgctgggtcttgccggggcacccggtggtgttgcagatggcgttctcgttgatggggtgctc 
caggccgccgtagcccaggaagatgatc 3' 

5) olf hnhum33 . seq, bases 2457-2549 of FR£ V HNpiv3 r homologous to mRNA 
5 1 cggcaagacccagcgggactgcaaccaggcctcccacagcccctggttctccgaccgccgcat 

ggtgaactccatcatcgtggtggacaaggg 3' 

6) olfhnhum34 . seq, bases 2532-2643 of F RS vHN Pi v3r inverse ' 
complementary to mRNA 

5' cttgttgcccagcagcagcaggcggccctcggagccccagtagttctgccgcatggagatggt 
ccacaccttcagcttggggatggagttcaggcccttgtccaccacgatg 3 1 

7) olfhnhum35. seq, bases 2628-2726 of . F RS vHN PiV 3, ' homologous to mRNA 
5 'gctgctgggcaacaagatctacatctacacccgctccaccagctggcacagcaagctgcagct 

gggcatcatcgacatcaccgactacagcgacatccg 3 ' • 

8) olf hnhum36. seq, bases 2710-2781 of FRsvHN PiV3/ inverse 
complementary to mRNA 

5 7 ggggcactcgttgttgccgggccggctcagcacgttgtgccaggtccacttgatgcggatgtc 
gctgtagtc 3' 

9) olfhnhum37.seq, bases 2765-2836 of FrsvHN P iv3 r homologous to mRNA 
5' gcaacaacgagtgcccctggggccactcctgccccgacggctgcatcaccggcgtgtacaccg 

acgcctacc 3 ' 

10) olfhnhum38 . seq, bases 2820-2889 of F RS vHN PiV 3, inverse 
complementary to mRNA 

5 ' cttctgggagtccaggatcacggagctcacgatgctgccggtggggttcagggggtaggcgtc 
ggtgtac 
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11) olf hnhum39 . seq, bases 2874-2943 of F R svHN P iv3/ homologous to mRNA 
5 ' cctggactcccagaagtcccgggtgaaccccgtgatcacctacagcacctccaccgagcgcgt 

gaacgag 

12) olfhnhum40. seq, bases 2927-2994 of F R svHN Pi v3f rom: 1 to: 68, 
inverse complementary to mRNA 

5' gcagctggtggtggtgtagccggcgctcagggtcttgttgcggatggccagctcgttcacgcg 
ctcgg ( 

13) olfhnhum41. seq, bases 2979-3043 of F RS vHNp iV 3, homologous to mRNA 
5' caccaccaccagctgcatcacccactacaacaagggctactgcttccacatcgtggagatcaa 

cc 

14) olfhnhum42. seq, bases 3027-3085 of ,F R svHN P iv3, inverse 
complementary to mRNA 

5 ■ cggtcttgaacagcatgggctggaaggtgtccaggctcttgtggttgatctccacgatg 3 1 

15) olfhnhum43 . seq, bases 3069-3114 of F RS vHNp iV 3, homologous to mRNA 
5' catgctgttcaagaccgagatccccaagagctgcagctaaGAATTC 3 1 

/ . • 

[SEQUENCES ABOVE ARE SEQ ID NOs : 40-54, respectively] 
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Fig. 32 : Construction of pEE14F RSV hum HN rjV3 hum 
a) pNI V4 120 +PCR fragment A 

BsrGI 

XbaJ 



EcoRl 



* RSV 

!265-1578XHN PiV J pUC19 



b) PCR fragment B 



BsrGI 



Kpnl 



Frsv HN PiV3 1 264-2 1 3 6hum 



c) PCR fragment C 



Kpnl 



EcoRI 



F RS V HN PiV 32l36- 
3090hum 



d) pEE14 F RSV hum HN PiV3 hum 



Xbal 



EcoRI 



F RSV humHN PlV3 h\im 



pEE14 
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Fig * 33A : Humanized nucleic acids sequence of F KSV HNp iV3 (upper sequ ncc) compared to 
the original sequence found in the pNIV4120. 

7 AGAGGATCC. ..... . ACCATGGAGCTGCTGATtttaAAGACCAACGCCA 49 

MINIMI I I I I I I I I I I I II I ! I , I I I I I I I I I 

2262 AGAGGATCCCCCGGGTAccatggagttgctaatcctcaaaacaaatgcaa 2311 

50 TCACCGCCATCCTGGCCGCGGTGACCCTCTGCTTCGCGTCCAGCCAGAAC 99 

I M I J I .MM I M II II M I I I I I II M M I M M Ml 
2312 ttaccgcaatccttgctgcagtcacactctgttttgcttccagtcaaaac 2361 

100 ATCACCGAGGAGTTCTACCAGTCCACCTGCtctGCTGTCAGCAAGGGCTA 14 9 

I I I I | III II M II II II Ml Ml I I I I I I I I I I M 
2362 atcactgaagaattttatcaatcaacatgcagtgcagtcagcaaaggcta 2411 

150 CCTGAGCGCGCTGAGGACGGGGTGGTACACtAGtGTGATCACCATCGAGC 199 

II II M II II II II Mill I I I I I I M II M M II 
2 412 tcttagtgctctaagaactggttggtatactagtgttataactatagaat 2 4 61 

200 TGAGCAACATCAAGGAGAACAAGTGCAACGGCACCGACGCCAAGGTGAAG 24 9 

. I II II II II II M III II I I II M II II M I I I I I I II 
24 62 taagtaatatcaaggaaaataagtgtaatggaacagacgctaaggtaaaa 2511 

250 .CTGATCAAGCAGGAGCTGGACAAGTACAAGAGGGCCGTGACCGAGCTGCA 299 

I I I I II II II I I I I I II M II I I II II I I I I I I 
2512 ttgataaaacaagaattagataaatataaaagtgctgtaacagaattgca 2561 

. . • • • . 

'3 00 GCTGCTGATGCAGAGCACCCCCGCCACCAACAACagaGCCAGGCGCGAGC 34 9 

I MM Mill I I I I I II II I I I I I I I I I I I I I I I I I I I 
2562 gttgctcatgcaaagcacaccggcaaccaacaatcgagccagaagagaac 2611 

350 TGCCCAGGTTCATG7UVCTACACCCTCAACAACACCAAGAACACCAACGTG 399 c 

I 11 II II I II I II II M I I I I I I I I I II I I II Mill II 
2612 taccaaggtttatgaattatacactcaacaataccaaaaataccaatgta 2661 

4 00 ACCGTGAGCAAGAAGcGcAAGaggCGcTTCCTGGGCTTCCTGCTGGGCGT 44 9 

M III) II III I II II I II I! HIM II I II II 
2 662 acattaagcaagaaaaggaaaagaagatttcttggctttttgttaggtgt 2711 

4 50 GGGCTCCGCCATCGCCAGCGGCATCGCGGTGTCCAAGGTCCTGCACCTGG 4 99 

M MM I I I I I I II I I I I I II II II II M M I I II I I M I 
2712 tggatctgcaatcgccagtggcattgctgtatctaaggtcctgcacctag 2761 

^ . . ... * 

500 AGGGGGAGGTGAACAAGATCAAGAGCGCCCTGCTCTCCACCAACAAGGCG 54 9 

I II I I I I I I M I I I I I I I I till II II Mill I I II I I I I 
27 62 aaggggaagtgaacaaaatcaaaagtgctctactatccacaaacaaggct 2 811 
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GTGGTCAGCCTGTCCAACGGCGTGAGCGTGCTGACCAGCAAGGTGCTGGA 599 

|| | | | | || ' I II II II II II II I I I I I I I I I III I II 
gtagtcagcttatcaaatggagttagtgtcttaaccagcaaagtgttaga 28 61 



CCTCAAGAACTACATCGACAAGCAatTGCTCCCCATCGTGAACAAGCAGt 

| | | | | I Mill II 11.11 II I I I I II I I I I I I I I I I I I I 
cctcaaaaactatatagataaacagttgttacctattgtgaacaagcaaa 



649 



2911 



699 



CCTGCAGCATCTCTAACATTGAGACCGTGATCGAGTTCCAGCAGAAGAAC 

Ml | | I II II I I I I I I I I •! I I I I I I I I I IN I I II I I I I I I 
gctgtagcatatcaaacattgaaactgtgatagagttccaacaaaagaac 29 61 



AACAGGCTGCTGGAGATCACCAGGGAGTTCAGCGTGAACGCgGGcGTcAC 

| | I I I II II II II I I I I I I I I I I I I I I I II II II II II 
aacagactactagagattaccagggaatttagtgttaatgcaggtgtaac 



749 



3011 



799 



CACCCCGGTGAGCACCTACATGCTGACCAACAGCGAGCTGCTGTCCCTGA 

II || || II I I I Kl Ml I N II II I' I I II' U 

tacacctgtaagcacttatatgttaacaaatagtgaattattatcattaa 30 61 



TCAACGACATGCCCATCACCAACGACCAGAAGAAGCTtATGTCCAACAAC 

| | | | | |. | | | I I II I I I I II II III I I I I I I I I I I I I I I I 
tcaatgatatgcGtataacaaatgatcagaaaaagttaatgtccaacaat 



849 



3111 



GTGCAGATCGTGCGCCAGCAGAGCTACagCATCATGagCATCATCAAGGA 899 

II || | | I I I I I I I I II Ml II I II I Mi ll I I I I I 
gttcaaatagttagacagcaaagttactctatcatgtccataataaagga 



3161 



900 GGAGGTGCTGGCCTACGTGGTGCAGCTGCCCCTGTACGGCGTGATCGACA 94 9 

Mi ll I II I I I I II II I M M II II M II M l 
3162 ggaagtcttagcatatgtagtacaattaccactatatggtgtaatagata 3211 

• • • 

950 CCCCCTGCTGGAAGCTGCACACCTCCCCCCTGTGCACCACCAACACCAAG 9 99 

I M || I I I II I I I I I I I I I 1 I I I M II M MINIM IM 
3212 caccttgttggaaactgcacacatcccctctatgtacaaccaacacaaag 32 61 



100 0 GAGGGCTCCAACATCTGCCTGACCCGCACCGACCGGGGCTGGTACTGCGA 

I I ! I 1 I M M I M I I hil l II I I I I I II MINIM II 
32 62 gaagggtccaacatctgtttaacaagaaccgacagaggatggtactgtga 



1049 



3311 



1050 C7\ACGCCGGCTCCGTGTCCTTCTTCCCCCTGGCGGAGACCTGCAAGGTGC 1099 

III II II II II I I I I I M I U II N il II I I II II I 
3312 caatgcaggatcagtatctttcttcccactagctgaaacatgtaaagttc 3361 

1100 AGTCCAACCGCGTGTTCTGCGACACCATGAACAGCCTGACCCTGCCCAGC 114 9 
I I I I I I I I I I I I I I N I I II II I II I III I M II 
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3362 aa'tcgaatcgagtattttgtgacacaatgaacagtttaacattaccaagt 3411 



1150 
3412 
1200 
3462 
1250 
3512 
1300 
3562 
1350 
3612 
1400 
3662 
1450 
3712 
1500 
3762 
1550 
3812 



GAGGTGAACCTCTGCAACATCGACATCTTCAACCCCAAGTACGACTGCAA 1199 

|| || || | I I I I I I I I I I Mill I I I I I I I I I I I M II Mill 
gaagtaaatctctgcaacattgacatattcaaccccaaatatgattgcaa 34 61 

GATtATGacctccaagaccgacgtgagcagctccgtgatcacctccctgg 124 9 

| | I I I I I I II II M M II I I I I I I I I I I I I I I I I M I I I 
aattatgacttcaaaaacagatg'taagcagctccgttatcacatctctag 3511 



gcgccatcgtgtcctgctacggcaagaccaagtgtacagcctccaacaag 

| Mill I I I I I I I I I I I M M II I I I I II II II I I J I I I I 
gagccattgtgtcatgctatggcaaaactaaatgtacagcatccaataaa 

aaccgcggcatcatcaagaccttctccaacggctgcgactacgtgtccaa 

M I I I I II I I i I I M I I I I I I I II I M I I I I I I I I I I 
aatcgtggaatcataaagacattttctaacgggtgtgattatgtatcaaa 

caagggcgtggacaccgtgtccgtgggcaacaccctgtactacgtgaaca 

| | I I I I I I I I I I I I I I I I I I I I M III II I I I' I' I 
taagggggtggacactgtgtctgtaggtaatacattatattatgtaaata 

agcaggagggcaagagcctgtacgtgaagggcgag'cccatcatcaacttc 

| I II II I I I I I II M II M M LI M II II II M -Ml 
agcaagaaggcaaaagtctctatgtaaaaggtgaaccaataataaatttc 

tacgacccgctggtgttcccctccgacgagttcgacgcctccatctccca 

I I I II II I I I II M I I I I I I I i I II II II II II I.I M 

tatgacccattagtgttcccctctgatgaatttgatgcatcaatatctca 

ggtgaacgagaagatcaaccagagcctggccttcatccgcaagtccgacg 

II || I I I I II I I I I I I! I I I I II II III! II M MIM I 
agtcaatgagaagattaaccagagcctagcatttattcgtaaatccgatg 

- 

agctgetgcacaacgtgaacgccggcaagtccaccaccaacatcatgaac 
I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I M 
aattattacataatgtaaatgctggtaaatccaccacaaatatcatgAAC 



1299 
3561 
1349 
3611 
1399 
3661 
1449 
3711 
1499 
3761 
1549 
3811 
1599 
38 61 



1600 aacgagttcatggaggtgaccgagaagatccagatggcctccgacaacat 164 9 

II I I I I I I I I I I II II M I I I I I II I I I I I I M M M II 
38 62 AATGAGTTTATGGAAGTTACAGAAAAGATCCAAATGGCATCGGATAATAT 3 911 

1650 caacgacctgatccagtccggcgtgaacacccggctgctgaccatccaga 1699 

II || || II I I I I I II II I II II III I I I M M MM 
3912 TAATGATCTAATACAGTCAGGAGTGAATACAAGGCTTCTTACAATTCAGA 3 9 61 
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1700 gccacgtgcagaactacatccccatctccctgacccagcagatgtccgac 1749 

| || || Mill II II II I I I I I I II II M I I I II II 

3 962 GTCATGTCCAGAATTATATACCaATATCATTGACACAACAAATGTCGGAT 4 011 

■ * . * * 

1750 ctgcggaagttcatcagcgagatcaccatccggaacgacaaccaggaggt 17 99 

M Ml II II II II M I I I I I I I I I I I I I I 

4 012 CTTAGGAAATTCATTAGTGAAATTACAATTAGGAATGATAATC7VAGAAGT 4 0 61 

1800 gcccccccagaggatcacccacgacgtgggcataaagcccctgaaccccg 1849 

| | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
4 0 62 GCCTCCACAAAGAATAACACATGATGTGGGCATAAAACCTTTAAATCCAG 4111 

1850 acgacttctggcgctgcacctccggcctcccctccctgatgaagaccccc 18 99 

I II II III I I I I I I I I I I II M M I I I I I.I II M 
4112 ATGATTTTTGGAGATGCACGTCTGGTCTTCCATCTTTAATGAAAACTCCA 4161 

1900 aagataaggctgatgcccgggcccggcctgctggccatgcccaccaccgt 194 9 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

4162 AAAATAAGGTTAATGCCGGGGCCGGGATTATTAGCTATGCCAACGACTGT 4 211 



1950 ggacggctgcgtg'cgcaccccctccctggtgatcaacgacctgatctacg 1999 

I I I I I I I I I I II II III I II II II II Mill II I 
4 212 TGATGGCTGTGTTAGAACTCCGTCCTTAGTTATAAATGATCTGATTTATG 42 61 

2000 cctacacctccaacctgatcacccgcggctgccaggacatcggcaagtcc 2 04 9 

III I I I I I II III II II II I M I I I I I I I II II II 
4 262 CTTATACCTCGAATCT7VATTACTCGAGGTTGCCAGGATATAGGAAAATCA 4 311 

2 050 taccaggtgctgcagatcggcatcatcaccgtgaactccgacctggtacc 2 0 99 

I I I I II I I I I I I II I I II I I II I I M I III I I I I I M 
4 312 TATCAAGTATTACAGATAGGGATAATAACTGTAAACTCAGACTTGGTACC 4 361 

a • • * • • 

2100 cgacctgaacccccggatctcccacaccttcaacatcaacgacaacagga 214 9 

Ml I II II I I I I I I I II II I I I I I I I I I I I I I I I II I 
4 362 TGACTTAAATCCTAGGATCTCTCATACTTTCAACATAAATGACAATAGAA 4 411 

* 

2150 agtcctgctccctggccctcctgaacaccgacgtgtaccagctgtgctcc 2199 

I I I I II II I I M I I I I I I I I I I II II II M I I I I I I I 
4412 AGTCATGTTCTCTAGCACTCCTAAACACAGATGTATATCT^CTGTGTTCG 4 4 61 

■ 

2 200 acgcccaaggtggacgagcgctccgactacgccagctccggcatcgagga 224 9 

1111111111111 I I I I I I I I I I I I I I I I II M 

4 4 62 ACTCCCAAAGTTGATGAAAGATCAGATTATGCATCATCAGGCATAGAAGA 4 511 



2250 catcgtgctggacatcgtcaaccacgacggctccatctccaccacccgct 22 99 

II II II II II I I I I I M II II II Mill II II I I 
4 512 TATTGTACTTGATATtGTCAATCATGATGGTTC7\ATCTCAACAACAAGAT 4 561 
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2300 tcaagaacaacaacatcagcttcgaccagccctacgccgccctgtacccc 234 9 

I I | I I I I I I II II II II* II II I I I I I I II I I I I I I 
4 5 62 TTAAGAACAATAATATAAGTTTTGATCAACCATATGCGGCATTATACCCA 4 611 

23 50 tccgtgggccccggcatctactacaagggcaagatcatcttcctgggcta 2399 

|| || || II II II I I I I I I I I I I I I I II II II II II II 
4 612 TCTGTTGGACCAGGGATATACTACAAAGGCATVlAATAATATTTCTCGGGTA 4 661 

. - * . - 

2 4 00 cggcggccf ggagcaccccatcaacgagaacgccatctgcaacaccaccg 24 4 9 

|| || II M II II II M I I I I I II I I I I I I I I I I I II I 
4 662 TGGAGGTCTTGAACATCCAATAAATGAG/^ATGC/VATCTGCAACACAACTG 4711 

2 4 50 ggtgccccggcaagacccagcgggactgcaaccaggcctcccacagcccc 24 99 

MM I I I I I II M III I I II I I I I I II I I I II II II M 
4 712 GGTGTCCCGGGAAAACGCAGAGAGACTGCAATCAGGCATCTCATAGTCCT 4 7 61 

• • » » * 

2500 tggttctccgaccgccgcatggtgaactccatcatcgtggtggacaaggg 254 9 

M II I II III I I I I I I I II I I M I I II II II I I I I I I I I 
4 7 62 TGGTTTTCAGACAGAAGGATGGTCAACTCCATTATTGTTGTTGACAAGGG 4 811 

a, ■ ■ ' • • 

2 550 cctgaactccatccccaagctgaaggtgtggaccatctccatgcggpaga 2599 

I | | | | I I II II II I I I I II I I MIM II I I I I I I I II I 
4 812 CTTAAACTCAATTCCAAAACTGAAGGTATGGACGATATCCATGAGACAAA 4 8 61 

'2 600 actactggggctccgagggccgcctgctgctgctgggcaacaagatctac 2 64 9 

I M III I I I I I II II I II II I I II II I I II I I M I I I 

4 8 62 ATTACTGGGGGTCAGAAGGAAGGCTACTTCTACTAGGTAACAAGATCTAT 4 911 

2 6 50 atctacacccgctccaccagctggcacagcaagctgcaactgggcatcat 2699 

II II I I I II M I I I II I I I I I I II I I I I II II II 

4 912 ATATATACT^AGATCTACAAGTTGGCATAGCAAGTTACT^ATTAGGAATAAT 4 961 

- • • • 

2700 cgacatcaccgactacagcgacatccgcatcaagtggacctggcacaacg 274 9 

II II II II I I I I I I I II I II II II I I I I II I I II I 
4 9 62 TGATATTACTGATTACAGTGATATAAGAATAAAATGGACATGGCATAA TG 5011 

• • • • ^. ■ 

2750 tgctgagccggcccggcaacaacgagtgcccctggggccactcctgcccc 2799 

MM I II II I M I I I I II II I I Ml I 1 IT I I I I I 

5012 TGCTATCAAGACCAGGAAACAATGAATGTCCATGGGGACATTCATGCCCA 50 61 

■ • • • * 

. 2800 gacggctgcatcaccggcgtgtacaccgacgcctaccccctgaaccccac 2849 

II II II II II M II II II M I I II II I I I I I I I I I 
5062 GATGGATGTATAACAGGAGTATATACTGATGCATATCCACTCAATCCCAC 5111 

• • • • * 

2*8 50 cggcagcatcgtgagctccgtgatcctggactcccagaagtcccgggtga 28 99 

II I I I I I I I I I I I I I I I I I I II II II I I I I I 1 
5112 AGGGAGCATTGTGTCATCTGTCATATTAGACTCGCAAAAATCGAGAGTAA 5161 
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2900 accccgtgatcacctacagcacctccaccgagcgcgtgaacgagctggcc 2949 

MM II II II IN II II II II I I I II I I I I I I I I I I 
5162 ACCCAGTCATAACTTACTCAACATCAACTGAAAGGGTAAACGAGCTGGCC 5211 

• • • • 

2950 atccgcaacaagaccctgagcgccggctacaccaccaccagctgcatcac 2999 

I I I I I I I I I I II II II II M I I I I I I I II I M I I II 
5212 ATCCGAAACAAAACACTCTCAGCTGGATATACAACAACGAGCTGCATTAC 52 61 

3000 ccactacaacaagggctactgcttccacatcgtggagatcaaccacaaga 304 9 

I I I I I I I I I I I I I I I I M I I I I I I I I I I Kl I I I I I - 
52 62 ACACTATAACAAAGGATATTGTTTTCATATAGTAGT^TAAATCATAAAA 5311 

3050 gcctggacaccttccagcccatgctgttcaagaccgagatccccaagagc 3099 

II 1111,1 I I III M ! M II II I I I I I II I I I M 

5312 GCTTAGACACATTCCAACCTATGTTGTTCAAMCAGAGAtTCCAAAAAGC 5361 

3100 tgcagctaaGAAT 3112 

I I I I I III II 
5362 TGCAGTTAATCAT ■ 5 374 
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Fig. 33B 

(Linear) MAP of: FrsvHNpiv3 . seq check: 7448 from: 1 to: 
3090 nucleic acids sequence of FrsvHNpiv3 (non humanised) 

atggagttgctaatcctcaaaacaaatgcaattaccgcaatccttgctgc 

agtcacactctgttttgcttccagtcaaaacatcactgaagaattttatc 

aatcaacatgcagtgcagtcagcaaaggctatcttagtgctctaagaact 

ggttggtatactagtgttataactatagaattaagtaatatcaaggaaaa 

taagtgtaatggaacagacgctaaggtaaaattgataaaacaagaattag 

ataaatataaaagtgctgtaacagaattgcagttgctcatgcaaagcaca 

ccggcaaccaacaatcgagccagaagagaactaccaaggtttatgaatta 

tacactcaacaataccaaaaataccaatgtaacattaagcaagaaaagga 

aaagaagatttcttggctttttgttaggtgttggatctgcaatcgccagt • 

ggcattgctgtatctaaggtcctgcacctagaaggggaagtgaacaaaat 

caaaagtgctctactatccacaaacaaggctgtagtcagcttatcaaatg 

gagttagtgtc'ttaaccagcaaagtgttagacctcaaaaactatatagat 

aaacagttgttacctattgtgaacaagcaaagctgtagcatatcaaacat 

tgaaactgtgatagagttccaacaaaagaacaacagactactagagatta 

ccagggaatttagtgttaatgcaggtgtaactacacctgtaagcacttat 

atgttaacaaatagtgaattattatcattaatcaatgatatgcctataac 

aaatgatcagaaaaagttaatgtccaacaatgttcaaatagttagacagc 

aaagttactctatcatgtccataataaaggaggaagtcttagcatatgta 

gtacaattaccactatatggtgtaatagatacaccttgtt.ggaaact.gca 

cacatcccctctatgtacaaccaacacaaaggaagggtccaacatctgtt 

taacaagaaccgacagaggatggtactgtgacaatgcaggatcagtatct 
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ttcttcccactagctgaaacatgtaaagttcaatcgaatcg'agtattttg 
tgacacaatgaacagtttaacattaccaagtgaagtaaatctctgcaaca 
ttgacatattcaaccccaaatatgattgcaaaattatgacttcaaaaaca 
gatgtaagcagctccgttatcacatctctaggagccattgtgtcatgcta 
tggcaaaactaaatgtacagcatccaataaaaatcgtggaatcataaaga 
cattttctaacgggtgtgattatgtatcaaataagggggtggacactgtg 
tctgtaggtaatacattatattatgtaaataagcaagaaggcaaaagtct 
ctatgtaaaaggtgaaccaataataaatttctatgacccattagtgttcc 
cctctgatgaatttgatgcatcaatatctcaagtcaatgagaagattaac 
cagagcctagcatttattcgtaaatccgatgaattattacataatgtaaa 
tgctggtaaatccaccacaaatatcatgAACAATGAGTT'TATGGAAGTTA 
CAGAAAAGATCCAAATGGCATCGGATAATATTAATGATCTT^ATACAGTCA 
GGAGTGAATAC/y\GGCTTCTTACAATTCAGAGTCATGTCCAGAATTATAT 
ACCaATATCATTGACACAACAAATGTCGGATCTTAGGTVAATTCATTAGTG 
AAATTACAATTAGGAATGATAATCAAGAAGTGCCTCCACAAAGAATAACA 
CATGATGTGGGCATAAAACCTTTAAATCCAGATGATTTTTGGAGATGCAC 
GTCTGGTCTTCCATCTTTAATGAAAACTCCAAAAATAAGGTTAATGCCGG 
GGCCGGGATTATTAGCTATGCCAACGACTGTTGATGGCTGTGTTAGAACT 
CCGTCCTTAGTTATAAATGATCTGATTTATGCTTATACCTCaAATCTAAT 
TACTCGAGGTTGCCAGGATATAGGAAAATCATATCAAGTATTACAGATAG 
GGATAATAACTGTAAACTCAGACTTGGTACCTGACTTAAATCCTAGGATC 
TCTCATACTTTCAACATAT^ATGACAATAGAAAGTCATGTTCTCTAGCACT 
CCTAAAtACAGATGTATATCAACTGTGTTCGACTCCCAAAGTTGATGAAA 
GATCAGATTATGCATCATCAGGCATAGAAGATATTGTACTTGATATtGTC 



DEC 04 2000.18:52 



PAGE . 97 



WO 00/18929 PCT/EP99/07004 

43/73 

AATCATGATGGTTCAATCTCAACAACAAGATTT7VAGAACAATAATATAAG 

TTTTGATCAACCATATGCGGCATTATACCCATCTGTTGGACCAGGGATAT 

/ 

ACTACAAAGGCAAAATAATATTTCTCGGGTATGGAGGTCTTGAACATCCA 
ATAAATGAGAATGCAATCTGCAACACAACTGGGTGTCCCGGGAAAACGCA 
GAGAGACTGCAATCAGGCATCTCATAGTCCcTGGTTTTCAGACAGAAGGA 
TGGTCAACTCCATTATTGTTGTTGACAAGGGCTTAAACTCAATTCCAAAA 
CTGAAGGTATGGACGATATCCATGAGACAAAATTACTGGGGGTCAGAAGG 
AAGGCTACT.TCTACTAGGTAACAAGATCTATATATATACAAGATCTACAA 
GTTGGCATAGCAAGTTACAATTAGGAATAATTGATATTACTGATTACAGT 
GATATAAGAATAAAATGGACATGGCATAATGTGtTATCAAGACCAGGAAA . 
CAATGAATGTCCATGGGGACATTCATGtCCAGATGGATGTATAACAGGAG 
TATATACTGATGCATATCCgCTCAATCCCACAGGGAGCATTGTGTCATCT 
GTCATATTAGACTCGCAAAAATCGAGAGTAAACCCAGTCATAACTTACTC 
AACAtCAACTGAAAGGGTAAACGAGCTGGCCATCCGAAACAAAACACTCT 
CAGCTGGATATACAACAACGAGCTGCATTACACACTATAACAAAGGATAT 
TGTTTTCATATAGTAG7VAATAAATCATAA7\AGCTTAGACACATTCCAACC 
TATGTTGTTCAAAACAGAGATTCCAAAAAGCTGCAGTTAA 
[SEQ ID NO: 55] 
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\ 




3090 Humanised nucleic acids sequence of FRSVHNPiV3 



GGGTGGTACACtAGtGTGATCACCATCGAGCTGAGCAACATCAAGGAGAA 
CAAGTGCAACGGCACCGACGCCAAGGTGAAGCTGATCAAGCAGGAGCTGG 
ACAAGTACAAGAGCGCCGTGACCGAGCTGCAGCTGCTGATGCAGAGCACC 
CCCGCCACCAACAACagaGCCAGGCGCGAGCTGCCCAGGTTCATGAACTA 
CACCCTCAACAACACCAAGAACACCAACGTGACCCTGAGCAAGAAGcGcA 
AGaggCGcTTCCTGGGCTTCCTGCTGGGCGTGGGCTCCGCCATCGCCAGC 
GGCATCGCGGTGTCCAAGGTCCTGCACCTGGAGGGGGAGGTGAACAAGAT 
CAAGAGCGCCCTGCTCTCCACCAACAAGGCGGTGGTCAGCCTGTCCAACG 



GCGTGAGCGTGCTGACCAGCAAGGTGCTGGACCTCAAGAACTACATCGAC 
AAGC Aa t TGC T CCCC ATCGT G AACAAGC AGt cCT GCAGC AT CTCT AACAT 



TGAGACCGTGATCGAGTTCCAGCAGAAGAACAACAGGCTGCTGGAGATCA 



ATGCTGACCAACAGCGAGCTGCTGTCCCTGATCAACGACATGCCCATCAC 
CAACGACCAGAAGAAGCTtATGTCCAACAACGTGCAGATCGTGCGCCAGC 
AGAGCTACagCATCATGagCATCATCAAGGAGGAGGTGCTGGCCTACGTG 



CACCTCCCCCCTGTGCACCACCAACACCAAGGAGGGCTCCAACATCTGCC 
TGACCCGCACCGACCGGGGCTGGTACTGCGACAACGCCGGCTCCGTGTCC 
TTCTTCCCCCTGGCGGAGACCTGCAAGGTGCAGTCCAACCGCGTGTTCTG 
CGACACCATGAACAGCCTGACCCTGCCCAGCGAGGTGAACCTCTGCAACA 



CCAGGGAGTTCAGCGTGAACGCgGGcGTcACCACCCCGGTGAGCACCTAC 



GTGCAGCTGCCCCTGTACGGCGTGATCGACACCCCCTGCTGGAAGCTGCA 
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TCGACATCTTCAACCCCAAGTACGACTGCAAGATtATGacct ccaagacc 

gacgtgagcagctccgtgatcacctccctgggcgccatcgtgtcctgcta 

cggcaagaccaagtgtacagcctccaacaagaaccgcggcatcatcaaga 

j 

ccttctccaacggctgcgactacgtgtccaacaagggcgtggacaccgtg 

t ccgtgggcaacaccctgtactacgtgaacaagcaggagggcaagagcct 

gtacgtgaagggcgagcccatcatcaactt ctacgacccgctggtgtztcc 

cctccgacgagttcgacgcctccatctcccaggtgaacgagaagatcaac 

cagagcctggccttcatccgcaagtccgacgagctgctgcacaacgtgaa 

cgccggcaagtccaccaccaacatcatgaacaacga v gttcatggaggtga 

ccgagaagatccagatggcctccgacaacatcaacgacctgatccagtcc 

ggcgtgaacacccggctgctgaccatccagagccacgtgcagaactacat 

ccccatctccctgacccagcagatgtccgacctgcggaagttcatcagcg 

agatcaccatccggaacgacaaccaggaggtgcccccccagaggatcacc 

cacgacgtgggcataaagcccctgaaccccgacgacttctggcgctgcac 

ctccggcctcccctccctgatgaagacccccaagataaggctgatgcccg 

ggcccggcctgctggccatgcccaccaccgtggacggctgcgtgcgcacc 

ccctccctggtgatcaacgacctgatctacgcctacacctccaacctgat 

cacccgcggctgccaggacatcggcaagtcctaccaggtgctgcagatcg 

gcatcatcaccgtgaactccgacctggtacccgacctgaacccccggatC" 

tcccacaccttcaacatcaacgacaacaggaagtcctgctccctggccct 

cctgaacaccgacgtgtaccagctgtgctccacgcccaaggtggacgagc 

gctccgactacgccagctccggcatcgaggacatcgtgctggacatcgtc 

aaccacgacggctccatctccaccacccgcttcaagaacaacaacatcag 

cttcgaccagccctacgccgccctgtacccctccgtgggccccggcatct 



DEC 04 2000 18:53 



PAGE. 100 



WO 00/18929 v PCT/EP99/07OO4 



46/73 

act'acaagggcaagatcatcttcctgggctacggcggcctggagcacccc 
atcaacgagaacgccatctgcaacaccaccgggtgccccggcaagaccca 
gcgggactgcaaccaggcctcccacagcccctggttctccgaccgccgca 
tggtgaactccatGatcgtggtggacaagggcctgaactccatccccaag 
ctgaaggtgtggaccatctccatgcggcagaactactggggctccgaggg 
ccgcctgctgctgctgggcaacaagatctacatctacacccgctccacca 
gctggcacagcaagctgcagctgggcatcatcgacatcaccgactacagc 
gacatccgcatcaagtggacctggcacaacgtgctgagccggcccggcaa 
caacgagtgcccctggggccactcctgccccgacggctgcatcaccggcg 
tgtacaccgacgcctaccccctgaaccccaccggcagcatcgtgagctcc 
gtgatcctggactcccagaagtcccgggtgaaccccgtgatcacctacag 
cacctccaccgagcgcgtgaacgagctggccatccgcaacaagaccctga 
gcgccggctacaccaccaccagctgcatcacccactacaacaagggctac 
tgcttccacatcgtggagatcaaccacaagagcctggacaccttccagcc 
eatgctgttcaagaccg^agatccccaagagctgcagctaa 



[SEQ ID NO : 56] 
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Fig. 34 A 
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Fig. 34B: Humanization impact on the level of expression of F RSV HN PiV3 
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Fig 35: Codon usage of F MuV H Mv and highly expressed human genes (hum high exp) 
The frequencies (xlOO) of the individual codbns are shown for each of the 
degenerately encoded amino acids, and the most prevalent codon is shown in bold. 
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Fig 36: Schematic diagram of the PCR synthesis of each fragment in which X and 
Y are restriction sites that allow retrieval of the full size fragment from the cloning 
vector. 
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Fig 37 : Sequence of the 12 oligonucleotides from which PCR fragment A was 
generated. 



l)oli 1 FmuvHmv 1-98, horn ARN 

aatctagaccaccATGAAGGCGTTCCCCGTGATCTGCCTGGGCTTCGCCATCTTCTCCAG 

_„+ „ + -+ — — + — + 60 

C AGC ATCT GC GT GAACATCAACATCCTGCAGCAGATCG 
___„„__+ „__+_— ——+— ----- 98 ■ . , 



2) oli 2 FmuvHmv 82-181, inv comp ARN 

GTTGGGCAGCAGCTTGACCACCACGTAGGAGCTGGAGCTCTGGGAGTAGTAGCtCAGCTG 

_ ; . + - - + : - --- +" — — • + - . ---- + ~f 60 

CCTGAGCTGCTGCTTGATGTATCCGATCTGCTGCAGGATG 

+ _ . : + — +- — — + 100 

3) oli 3 FmuvHmv, 166-264 horn ARN 

CAAGCTGCTGCCCAAGATCCAGCCCACCGACAACAGCTGCGAGTTCAAGAGCGTGACCCA 

. + +- + _4— — — + 60 

GTACAACAAGACCCTGAGCAACCTGCTGCTGCCCATCGC 
- + -- — - — •+--; — — — 99 



4) oli 4 FmuvHmv, 250-352, inv comp ARN 

CAGGGCGGCGATGCCGATGGCGATGCCGGCGAACCGCTTGTGCGGCCGGGAGCCGGGGGA 

___„+—_ — — + - -r-+ : — — — 60 

GGGGGAGGTGATGTTGTTGATGTTCTCGGCGATGGGCAGCAGC 

.____„+ — . +~ — - + - — .103' 



5) oli 5 FmuvHmv, 338-441, horn ARN . & x 

GGCATCGCCGCCCTGGGCGTGGCCACCGCCGCCCAGGTGACCGCCGCCGTGTCCCTGGTG 

____„+_____ _+__„——+— — — — + ( -+— -'- + 60 

CAGGCCCAGACCAACGCCCGCGCCATCGCCGCCATGAAGAACTC ' v 

. 4-_- : 4- - — + -K 104 



6) oli 6 FMuvHmv, 427-523, inv comp ARN 

GTCCTGGATGGCCTGCACGGCGATGGCCAGCTGCTGGGTGCCCTCCTTCACCTCGAACAC 

—-4- + r ~ — + — — " + • + " + 60 

GGCGCGGTTGGTGGCGTGGATGGAGTTCTTCATGGCG 
4-- „__4-— - — — "97 



7) oli 7 FMUVHM 509-610, horn ARN 

CAGGCCATCCAGGACCACATCAACACCATCATGAACACCCAGCTGAACAACATGTCCTGC 

; —4- +—- - -_-+ — — — — + -+ ~ — + 60 

CAGATCCTGGACAACCAGCTGGCCACCTCCCTGGGCCTGTAG 
— + 4- + + — 102 
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a\ nii R FMUHM. 595-691., inv comp ARN 

GGACCGCAGGGCCTGGATACTGATGGGggaCAGGGCGGGGTTGATCAGCTGGGGCTGGAA 

+ + + + + + 

CACGGTGGTCAGCTCGGTCAGGTACAGGCCCAGGGAG 

+ + : + 97 



9) oli 9, 677-778, hom ARN 



CAGGCCCTGCGGTCCCTGCTGGGCAGCATGACCCCCGCCGTGGTGCAGGCCACCCTGAGC 

+ + + + + " DU 

ACCTCCATCAGCGCCGCCGAGATCCTGAGCGCCGGCCTGATG 
m + ^ h H 102 



im ' niH 10 FmuvHmv, 763-862, inv comp ARN 

+ +- + ■ + 60 



GTTGGACTGGGTCACGATGGTGGGCACGTTGAT 
- + — * 

CAGCACGGACACGATCTGGCCCTCCATCAGGCCGGCGCTC 



+ + — ~ + — 7~ + 100 



homARN 



11 \ n i'v 11 FmuvHmv, 8 48-949, 

GTGACCCAGTCCAACGCCCTGGTGATCGACTTCTACAGCATCAGCAGCTTCATCAACAAC 

_. . " + + +— : . --+ " + 

CAGGAGTCCATCATCCAGCTGCCCGACCGCATCCTGGAGATC 

. + 



+ — — + _—-+-- 102 



1?> oli 12 FMUHM, 935-1039, inv compARN 

GCTCAGCCGCTCGGCCTCGTTGTACTGGCAGAAGATGTGGTGGCGGGTCAGCTTGCAGTT 

+ _+ + '-+ + _+ tou 

CTTGGCGGGGTAGCGCCACTGCTCGTTGCCGATCTCCAGGATGCG 

+ + — : — +— + 105 

[SEQ ID NOS: 57-68 respectively] 
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Fig. 38 : Sequence of the 9 oligonucleotides from which PCR fragment B was 
generated. 

12) oli 12 FMUHM, 935-1039, inv compARN 

GCTCAGCCGCTCGGCCtCGTTGTAeTGGCAGAAGATGTGGTGGCGGGTCAGCTTGCAGTT 

„_+ h +- • — +— — +- + 60 

CTTGGCGGGGTAGCGCCACTGCTCGTTGCCGATCTCCAGGATGCG 

---+ - - '--+- — -+ : -+ "~ 105 

13) oli 13, 1025-1129, hom ARN 

- GCCGAGCGGCTGAGCCTGGAGACCAAGCTGTGCCTGGCCGGCAACATCAGCGCCTGCGTG 

+ r _!- __+_ — . + - +-- 60 



+ -< 

TTCTCCAGCATCGCCGGCAGCTACATGCGCCGCTTCGTGGCCCTG 

. _ H ; 



+__„„—+ — ———+- — -- 105 



- ■ ■ - * 

; , 1 

14) oli 14, 1115-1216, inv comp ARN 

• GGCGTGGTGGTCGGGCTGGTAGATGGGGTAGGAGGGGCTCTTGCACAGGCAGGTCAGGCT 

+ ; -: _„+.-__-l_ _+— — + — — — + ---—— — + 60 

GCGGCAGTTGGCCACGATGG.TGCCGTCCAGGGCCACGAAGCG 

____+_ — ~ +— — + — — +-'- 102. 

15) oli 15, 1202-1299, horn ARN ' 

CCCGACCACCACGCCGTGACCACCATCGACCTGACCTCCTGCCAGACCCTGAGCCTGGAC ■ 

„__+ +——-—+———-+ — —+ 60 

GGCCTGGACTTCAGCATCGTGTCCCTGAGCAACATCAC 
> — + -+--— — + — " ~ 98 

'' ■ ■ . j. 

16) oli 16 1285-1387, inv comp ARN 

CTTGCTCAGCTCGGTGGAGATGTGGATGGGCTGGGTGTTGATGGTCTGGCTCAGGCTGAT 

„^ h -+---- „ + + -- — - — + 60 

GGTCAGGTTCTCGGCGTAGGTGATGTTGCTCAGG 

_ +— — +— — + 94 \ 



17) oli 17, 1363-1462, hom /ARN 

CACCGAGCTGAGCAAGGXGAACGCCTCCCTGCAGAACGCCGTGAAGTACATCAAGGAGAG 

~+— . -+ : +—.———+- -+ • + 60 

CAACCACCAGCTGCAGAGCGTGAGCGTGAGCAGCAAGCGC , - ' 

+ + : + -+ 100 . 



18) oli 18, 1447-1550, inv comp ARN 

TCACCTGGTGCTCGATGGAGTTGGTCACGTCCAGGTTGGTGCTCAGGCTCTTGTGGATGT 

_+„_ — + .__+_. - — + + — + 60 

CGGCGGTGTAGATGGCGGCGCGGTGCAGGCGCTTGCTGGTCACG 

+ ._+„.— -+ 104 . 
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19) oli 19, 1534-1636, horn ARN 

CATCGAGCACCAGGTGAAGGACGTGCTGACCCCCCTGTTCAAGATCATCGGCGACGAGGT 

+ _+ — — + + + 60 

GGGCCTGCGCACCCCCCAGCGCTTCACCGACCTGGTGAAGTTC 
+ . + + + 103 

20) oli 20 FmuvHmv, 1622-1718, inv comp ARN 

GCTCGGGGGGGTTGATGCACCAGGTCAGGTCGCGGAAGTCGTACTCGCGGTCGGGGTTCA 

„ + . — + + + + + 60 

GGAACTTGATCTTGTCGGAGATGAACTTCACCAGGTC 
. — + -+ +— 97 



[SEQ ID NOS: 69-77 respectively] 
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Fig. 39 : Sequence of the 11 oligonucleotides from which PCR fragment C was 
generated. 

20) oli 20 FmuvHrnv, 1622-1718, inv corap ARN 

GCTCGGGGGGGTTGATGCACCAGGTCAGGTCGCGGAAGTCGTACTCGCGGTCGGGGTTCA 

+ + + — + — ■ + + 60 

GGAACTTGATCTTGTCGGAGATGAACTTCACCAGGTC 
. +. r — + + 97 

21) oli 21, FmuvHmv, 1701-1799, horn ARN 

GCATCAACCCCCCCGAGCGGATCAAGCTGGACTACGACCAGTACTGCGCCGACGTGGCCG 

+ + +- + + - : 60 

CCGAGGAGCTGATGAACGCCCTGGTGAACAGCACCCTGC 
+ 99 

22) oli 22, 17B4-1888, inv comp 

CATGTTGCTGAACTGGCCCCGGATGGTGGTGGGGCCGCTGCAGTTGCCCTTGCTCACGGC 
+ + +-- +- + + 60 

CAGGAACTGGTTGGTGGTGCGGGTCTCCAGCAGGGTGCTGTTCAC 
+ . + + + 105 

23) oli 23, 1874-1971, horn ARN 

CAGTTCAGCAACATGAGCCTGTCCCTGCTGGACCTGTACCTGGGCCGGGGCTACAACGTG 

+ + + + + 60 

AGCAGCATCGTGACCATGACCAGCCAGGGCATGTACGG 
— :-- + + ■ — ■ — +- 98 



24) oli 24, 1957-2057, inv. comp ARN 

CCACCTCGAACACGCGGTACATGCTCAGCTGGCTCAGCTCGCTCCGCTTGCTGCTCAGGT 

:__ + + + + — + — + 60 

TGGGCTTCTCCACCAGGTAGGTGCCGCCGTACATGCCCTGG 

+ + 4- : + - 101 

25) oli 25, FmuvHrnv, 2043-214 0, homARN 

GCGTGTTCGAGGTGGGCGTGATCCGGT^ACCCCGGCCTGGGCGCCCCCGTGTTCCACATGA 
+ f + + + + 60 

CCAACTACCTGGAGCAGCCCGTGAGCAACGACCTGAGC 
+ -f— + - 98 

26) oli26, FmuvHrnv, 2125-2227, inv comp ARN 

GCCGCTGCCCTGGTAGGGGATGGTGATGCTGTCCTCGCCGTGGCACAGGGCGGCCAGCTT 

+ + + + + + 60 

CAGCTCGCCCAGGGCCACCATGCAGTTGCTCAGGTCGTTGCTC 
+ + + + 103 

27) oli 27, 2212-2309, FmuvHm, horn ARN ^ 

GTACCAGGGCAGCGGCAAGGGCGTGAGCTTCCAGCTGGTGAAGCTGGGCGTGTGGAAGAG 
+ -+ + + + — + 60 

CCCCACCGACATGCAGAGCTGGGTGCCCCTGAGCACCG 
-+ + 98 
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28) oli 28,E*iuvHrov, 2294-2392, inv comp ARN 

GGTGGGCACGGCCCACTTGGCCTGGTTGTCGGCGATCACGCCGCGGTGGCTGCTCAGGTA 

+ + : + + + + 60 

CAGGCGGTCGATCACGGGGTCGTCGGTGCTCAGGGGCAC 
+ ~ + + 99 

29) oli 29, Fmuv Hmv, 2377-2477, horn ARN 

GTGGGCCGTGCCCACCACCCGCACCGACGACAAGCTGCGCATGGAGACCTGCTTCCAGCA 

+ ; + + + + + 60 

GGCCTGCAAGGGCAAGATCCAGGCCCTGTGCGAGAACCCCG 
+ + + — + - 101 

30) oli 30*, FmuvHmv 2462-2561, inv comp 

TGATCTTCAGCTCCACGGTCAGGCTCAGGTCCACGCTCAGCACGCCGTAGCTGGGGATGC 

_ + + + + + + 60 

GGTTGTCCTTCAGGGGGGCCCAtTCGGGGTTCTCGCACAG 
+ + + + 100 

[SEQ ID.NOS: 78-88 respecively] 
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Fig. 40 : Sequence of the 8 oligonucle tides from which PCR fragment D was 
generated. 

30) oli 30, FmuvHmv 2462-2561, inv comp 

TGATCTTCAGCTCCACGGTCAGGCTCAGGTCCACGCTCAGCACGCCGTAGCTGGGGATGC 
4- ~ — + + : — ~ + — + 60 

GGTTGTCCTTCAGGGGGGCCCAtTCGGGGTTCTCGCACAG 
: 4- + + + 100 



31) oli31, FmuvHmv, 2546-2649, horn ARN 

GTGGAGGTGAAGATCAAGATCGCGAGCGGCTTCGGCCCCCTGATCACCCACGGCAGCGGC 

+ -. + h '■ — + 60 

ATGGACCTGTACAAGAGCAACCACAACAACGTGTACTGGCTGAC 
- + + . 4- +• 104 



32) oli 32, FmuvHmv, 2635-2738, inv comp ARN 

CGGTGAACAGGTAGGGGCTCACCTTGAAGCGGGGaATCCACTCCAGGGTGTTGATCACGC 

_ + 4- + + 4*- + 60 

CCAGGGCCAGGTTCTTCATGGGGGGGATGGTCAGCCAGTACACG 
+ _+ + + 104 



33) oli 33, FmuvHmv, 2723-2827, hom ARN 

CCCTACCTGTTCACCGTGCCCATCAAGGAGGCCGGCGAGGACTGCCACGCCCCGACCTAC 

+ — 4--- ■ 4- + + — - — + 60 

CTGCCCGCCGAGGTGGACGGCGACGTGAAGCTGAGCAGCAACCTG 

„' + .+ : +- +- 105 



34) oli 34 f FMUVHmv, 2813-2911, inv comp ARN 

CACGTAGTACACCACGGCGTGCTCCACGCGGCTGGTGTCGTAGGTGGCCAGGACGTACTG 

4. _ -4. 4- + + + 60 

CAGGTCCTGGCCGGGCAGGATCACCAGGTTGCTGCTCAG 
+ + +■ — = 99 



35) oli 35 FMUHM, 2897-2995, homARN — ~ 

GTGGTGTACTACGTGTACAGCCCCGGCCGCAGCTTCTTCTAGTTCTACCCCTTCCGCCTG 
+ + + + : 4- — —4- 60 

CCCATCAAGGGCGTGCCCATCGAGCTGCAGGTGGAGTGC 

— : + _ + 4- 99 



36) oli 36, FmuvHmv, 2981-3078, inv comp ARN 

CCGCTGTGGGTGATGTGGCCGCCGCTCTCGCTGTCGGCCAGCACGCAGAAGTGGCGGCAC 

^_+____ + ----4 + +" " + 60 

CACAGCTTCTGGTCCCAGGTGAAGCACTCCACCTGCAG ■ 
+ _ 4- + 98 
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37) oli 37, ■ 3064-3147, homARN 

CATCACCCACAGCGGCATGGTGGGCATGGGCGTGAGCTGCACCGTGACCCGCGAGGACGG 
; + — + + + + + 60 

CACCAACCGCCGCTAGcgaattcc 
+ + 84 



[SEQ ID NOS: 89-96 respectively] 
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Fig. 41 : Construction of pEE14F MuV hum HN M vhum 

a) PCR fragment A 
Xbal 



TspRJ 



Fj^vhum 1 -965 



pCRIITOPO!9 



b)pCR fragment B 



TspRI 



Aval 



1 






F M - V 965-1712 







pGRIITOP019 



c) PCR fragment C 



Aval 



Apal 









HMV17I2- 
24&5 







-pCRIITOP019 



d) PCR fragment D 



Apal 



EcoRI 



Hmv2485- 
U39 



pCRIITOP019 



d) pEE14 F MuV hum H MV hum 



Xbal 



EcoRI 



F MuV hum(aal -482)H Mv hum(aa59-6 1 7) 



-pEE14 



HFC t?d 2%m 18=56 
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Figure 42A : Humanised nucleic acids sequence of F^^H^y (upper 
sequence) compared to the original F MuV H MV sequence (lower sequence) and 
the corresponding amino acids sequence. 



14 ATGAAGGCGTTCCCCGTGATCTGCCTGGGCTTCGCCATCTTCTCCAGCAG 63 

I | | | I I I I II II II II II I I I I I I I I II Mill II I 

1 ATGAAGGCTTTTCCAGTTATTTGCTTGGGCTTTGCAATCTTTTCATCCTC 50 

. - • • 

6 4 CATCTGCGTGAACATCAACATCCTGCAGCAGATCGGATACATCAAGCAGC 113 

II || Mill II II I Ml M I I II I M II II II I I II II I I I 
5 1 TATATGTGTGAATATCAATATCTTGCAGCAAATTGGATACATCAAGCAAC 10 0 

- • • • * 

114 AGGTGAGGCAGCTGAGCTACTACTCCCAGAGCTCCAGCTCCTACGTGGTG 163 

MM II I I I II II I II I M I I II II II I II II II I I I! Ml 
101 AGGTCAGGCAACTAAGCTATTACTCACAAAGTTCAAGCTCCTACGTAGTG 150 

164 GTCAAGCTGCTGCCCAACATCCAGCCCACCGACAACAGCTGCGAGTTCAA 213 

II I I II II I II II Mill II I II II I I II I II I II M II 

151 G T C AAG CT T T T AC CG AAT AT C C AAC C C ACT G AT AAC AG CT G T G AAT T T AA 200 

214 GAGCGTGACCCAGTACAACAAGACCCTGAGCAACCTGCTGCTGCCCATCG 2 63 

Ml I I II M Mill II I III I II I II I I I I II II M I 
2 01 GAGTGTAACTCAATACAATAAGACCTTGAGTAATTTGCTTCTTCCAATTG 250 

• - - « 

2 64 CCGAGAACATCAACAACATCACCTCCCCCTCCCCCGGCTCCCGGCGGCAC 313 

I II I I I II I I II I M M II I II I I II M II I II II 

2 51 CAGAAAACATAAACAATATTACGTCGCCCTCACCTGGGTCAAGACGTCAT 300 

. • • - 

314 AAGCGGTTCGCCGGCATCGCCATCGGCATCGCCGCCCTGGGCGTGGCCAC 363 

II II II I II Mill Mill MM! M Mill II II II II 

301 AAACGGTTTGCTGGCATTGCCATTGGCATTGCgGCcCTCGGTGTTGCGAC 350 

364 CGCCGCCCAGGTGACCGCCGCCGTGTCCCTGGTGCAGGCCCAGACCAACG 413 

III I I II II I II II I I I M I I I II II II I II i I M I 

351 CGCAGCACAAGTGACTGCCGCTGTCTCATTAGTTCAAGCACAGACAAATG 400 

• • • » • 

414 CCCGCGCCATCGCCGCCATGAAGAACTCCATCCAGGCCACCAACCGCGCC 4 63 

I II II II II II II II I III! II M II I I I I I II II 

4 01 CACGTGCAATAGCAGCGATGAAAAATTCAATACAGGCAACTAATCGGGCA 4 50 

4 64 GTGTTCGAGGTGAAGGAGGGCACCCAGCAGCTGGCCATCGCCGTGCAGGC 513 

II I II II II II II I I M 11 M M II I I II II II II II II 

4 51 GTCTTCGAAGTGAAGGAAGGCACCCAACAGTTAGCTATAGCGGTACAAGC 500 

• • • • 

514 CATCCAGGACCACATCAACACCATCATGAACACCCAGCTGAACAACATGT 563 

I M I II I II M II I II II II II I II I II M I I M II II II II 
501 cATcCAAGACCATATCAATACTATTATGAACACCCAATTGAACAATATGT 550 



DEC 04 2030 18:57 



PPGE . 1 



WO 00/18929 



PCT/EP99/07004 



61/73 



564 CCTGCCAGATCCTGGACAACCAGCTGGCCACCTCCCTGGGCCTGTACCTG 613 

| || I I I I 1 I I I II I I I I I I I I II II 1 I I I I I II I I I I I I 
551 CTTGTCAGATCCTTGATAACCAGCTTGCAACCTCCCTAGGATTATACCTA 600 

614 ACCGAGCTGACCACCGTGTTCCAGCCCCAGCTGATCAACCCCGCCCTGtC 6 63 

M I I I I I I I I I I I I I I I I I M I M INI M MM 

601 ACAGAATTAACAACAGTGTTTCAGCCACAATTAATTAATCCAGCATTGTC 650 

664 cCCCATCAGTATCCAGGCCCTGCGGTCCCTGCTGGGCAGCATGACCCCCG 713 

I I M I II M I I I I I II I I I I I I II II II 1 II I I II I 

651 ACCGATTAGTATACAAGCCTTGAGGTCTTTGCTTGGAAGTATGACACCTG 700 

- • • 

714 CCGTGGTGCAGGCCACCCTGAGCACCTCCATCAGCGCCGCCGAGATCCTG 7 63 

I I I I I I I I I I I I I II II I I M M M II II 

701 CAGTGGTTCAAGCAACATTATCTACTTCAATTTCTGCTGCTG7VAATACTA t 7 50 

• • - 

7 64 AGCGCCGGCCTGATGGAGGGCCAGATCGTGTCCGTGCTGCTGGACGAGAT 813 

ii mini 1 1 1 ii 1 1 1 ii 1 1 1 M ii ii i M ii M 1 1 1 1 1 

7 51 AGTGCCGGTCTAATGGAGGGTCAGATAGTTTCTGTTCTGCTAGATGAGAT 8 00 

• - - 

814 GCAGATGATCGTGAAGATCAACGTGCCCACCATCGTGACCCAGTCCAACG 8 63 

I I II I 1 I I I II I I II I Mill II II I I I M M II II II I 

801 GCAGATGATAGTTAAGATAAACGTTCCAACCATTGTCACACAATCAAATG 850 

. - - 

8 64 CCCTGGTGATCGACTTCTACAGCATCAGCAGCTTCATCAACAACCAGGAG 913 

I I I I II I I I I I I I II I I If I t II I I I I II M M 

8 51 CATTGGTGATTGACTTCTACTCAATTTCGAGTTTTATTAATAATCAAGAA 900 

914 TCCATCATCCAGCTGCCCGACCGCATCCTGGAGATCGGCAACGAGCAGTG 963 

I I I I I || II II I I III I M I I I I I I M I I I II M M II 

9 01 TCCATAATTCAATTGCCAGACAGGATCTTGGAGATCGGAAATGAACAATG 950 

- 

964 GCGCTACCCCGCCAAGAACTGCAAGCTGACCCGCCACCACATCTTCTGCC 1013 

I I I I 1 I II II Mill II III I II I I I I I II I M I I II II I . 
951 GCGCTATCCAGCTAAGAATTGTAAGTTGACAAGACACCACATATTCTGCC 1000 

■ • • • 

1014 AGTACAACGAGGCCGAGCGGCTGAGCCTGGAGACCAAGCTGTGCCTGGCC 10 63 

I I I I I I I I II I III I I I I I I I I I I II II II M I I I M II • 
1001 AATACAATGAGGCAGAGAGGCTGAGCCTAGAAACAAAACTATGCCTTGCA 1050 

• * . ■ 

10 64 GGCAACATCAGCGCCTGCGTGTTCTCCAGCATCGCCGGCAGCTACATGCG 1113 

I I I I I I I I I I II I I I I I II I I I I I II I I II M I I I I 

10 51 GGCAATATTAGTGCCTGTGTGTTCTCATCTATAGCAGGGAGTTATATGAG 1100 

1114 CCGCTTCGTGGCCCTGGACGGCACCATCGTGGCCAACTGCCGCAGCCTGA 1163 

II 11 II II I I I I I II M M H II I II I I II M M I 
1101 GCGATTTGTAGCACTGGATGGAACAATTGTTGC7VAACTGTCGAAGTCTAA 1150 

1164 CCTGCCTGTGCAAGAGCCCCTCCTACCCCATCTACCAGCCCGACCACCAC 1213 

I II II I I I I III I II II II II ILIUM M MMI M 
1151 CGTGTCTATGCAAGAGTCCATCTTATCCTATATACCAACCTGACCATCAT 1200 
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. B ■ ■ - 

1214 GCCGTGACCACCATCGACCTGACCTCCTGCCAGACCCTGAGCCTGGACGG 1263 

M II I I I I I I I El II I I 11 II II MM I I I I I I I I I 
1201 GCAGTCACGACCATTGATCTAACGTCATGTCAAACATTGTCCCTGGACGG 1250 

■ * • * * 

12 64 CCTGGACTTCAGCATCGTGTCCCTGAGCAACATCACCTACGCCGAGAACC 1313 

.Mill I I I I II II II II II II I I I I II It I I I I II Mill I 
1251 ACTGGATTTCAGCATTGTCTCGCTAAGCAACATCACTTACGCTGAGAATC 1300 

a a • * • 

1314 TGACCATCAGCCTGAGCCAGACCATCAACACCCAGCCCATCGACATCTCC 13 63 

I 1 I I I II M I II I I I I I I I I I II I I I I I I I I I 

1301 TTACTATTTCATTGTCTCAGACAATCAATACTCAACCCATTGATATATCA 1350 

• ■■ * * * 

1364 ACCGAGCTGAGCAAGGTGAACGCCTCCCTGCAGAACGCCGTGAAGTACAT 1413 

II I I I I I II I I I I I r M II Mill II II IE I I I I I II I I I 
1351 ACTGAGCTGAGTAAGGTTAATGCATCCCTCCAAAATGCCGTTAAATACAT 14 0 0 

1414 CAAGGAGAGCAACCACCAGCTGCAGAGCGTGAGCGTGAGCAGCAAGCGCC 14 63 

I I I I II I I I I I I II II I I III II II 11 II II 

14 01 AAAA G A G A G T AAC CAT C AAC T C C AAT C C G T TAG T G T AAG T T C T AAAAG AC 14 50 

14 64 TGCACCGCGCCGCCATCTACACCGCCGAGATCCACAAGAGCCTGAGCACC 

■ I I I I II I I I I III M I I I II I II III II II I II I I I I I I I I 
14 51 TTCATCGGGCAGCCATCTACACCGCAGAGATCCATAAAAGCCTCAGCACC 

• • • • * 

1514 AA C C T G G A C GT G AC C AAC T C CAT C GAG C AC C AG G T G AAG G A CG T G C T G AC 

II II II I I II I II II II I I I II I I I I I I I I I I M I M I I I I I 
1501 AATCTAGATGTAACTAACTCAATCGAGCATCAGGTCAAGGACGTGCTGAC 

» • • • « ■ 

1564 CCCCCTGTTCAAGATCATCGGCGACGAGGTGGGCCTGCGCACCCCCCAGC 1613 

II II Mill I I II I I II II II 'I I I I II II I I II II III 

1551 ACCACTCTTCAAAATCATCGGTGATGAAGTGGGCCTGAGGACACCTCAGA 1600 

***** 

1614 GCTTCACCGACCTGGTGAAGTTCATCTCCGACAAGATCAAGTTCCTGAAC 1663 

I Mill Mill Mill I I I II I II- II I II I II I I I I I II M 

1601 GATTCACTGACCTAGTGAAATTCATCTCTGACAAGATTAAATTCCTTAAT 1650 

• • ■ * • 

1664 CCCGACCGCGAGTACGACTTCCGCGACCTGACCTGGTGCATCAACCCCCC 1713 

II II I I II I I II I I I I I I II M I I M I I I I I I I I II I II 
1651 CCGGATAGGGAGTACGACTTCAGAGATCTCACTTGGTGTATCAACCCGCC 1700 

1714 CGAGCGGATCAAGCTGGACTACGACCAGTACTGCGCCGACGTGGCCGCCG 17 63 

III I I I I II I I II I I M I I I I I I I I I I I I M I 1 I I I 
1701 AGAGAGAATCAAATTGGATTATGATCAATACTGTGCAGATGTGGCTGCTG 1750 

17 64 AGGAGCTGATGAACGCCCTGGTGAACAGCACCCTGCTGGAGACCCGCACC 1813 

i ii i ii 1 1 1 ii it i r m M ii ii ii 1 1 1 1 1 1 1 1 1 i ii 

1751 AAG AG C T CAT GAAT GC AT T G G T G AAC T C AAC T CT A C T G G AGAC C AG AAC A 1800 

• a .a • • 

1814 ACCAACCAGTTCCTGGCCGTGAGCAAGGGCAACTGCAGCGGCCCCACCAC 18 63 

I I II I I II 1 II I I II II Mill M II I I II II M I II 

18 01 ACCAATCAGTTCCTAGCTGTCTCAAAGGGAAACTGCTCAGGGCCCACTAC 1850 



1513 
1500 
1563' ' 
1550 
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18 64 CATCCGGGGCCAGTTCAGCAACATGAGCCTGTCCCTGCTGGACCTGTACC 1913 

IN | || || III I I I I I I MINIMI I I I I I I I I 
1851 AATCAGAGGTCAATTCTCAAACATGTCGCTGTCCCTGTTAGACTTGTATT 1900 

« - • • 

1914 TGGGCCGGGGCTACAACGTGAGCAGCATCGTGACCATGACCAGCCAGGGC 1963 

| | I I I II I I I I I III II II 1 I I I I I I MINI 

1901 TAGGTCGAGGTTACAATGTGTCATCTATAGTCACTATGACATCCCAGGGA 1950 

1964 ATGTACGGCGGCACCTACCTGGTGGAGAAGCCCAACCTGAGCAGCAAGCG 2013 

I | I I I II II M M I I I I I M I I I II I II I I I I I I I I I I I I 
1951 ATGTATGGGGGAACTTACCTAGTGGAAAAGCCTAATCTGAGCAGCAAAAG 2000 

2014 GAGCGAGCTGAGCCAGCTGAGCATGTACCGCGTGTTCGAGGTGGGCGTGA 2063 

| Ml || || I I I I I I I I I I II I I I I I I I I I II II M I 
2001 GTCAGAGTTGTCACAACTGAGCATGTACCGAGTGTTTGAAGTAGGTGTTA 2050 

. ^ . * • 

2 064 TCCGGAACCCCGGCCTGGGCGCCCCCGTGTTCCACATGACCAACTACCTG 2113 

M i nil it 1 1 1 i 1 1 1 1 i ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

2 0 51 TCAGAAATCCGGGTTTGGGGGCTCCGGTGTTCCATATGACAAACTATCTT 2100 

• * * ~ ■ * 
2114 GAGCAGCCCGTGAGCAACGACCTGAGCAACTGCATGGTGGCCCTGGGCGA 2163 

I I I I I II II II II II II I I I I I I I I I I I I I I I I I I I I M 
2101 GAGCAACCAGTCAGTAATGATCTCAGCAACTGTATGGTGGCTTTGGGGGA 2150 

2164 GCTGAAGCTGGCCGCCCTGTGCCACGGCGAGGACAGCATCACCATCCCCT 2213 

Ml || II I I I M II II I I I I I II II I I I M II I I I I 

2151 GCTCAAACTCGCAGCCCTTTGTCACGGGGAAGATTCTATCACAATTCCCT 2200 

2214 ACCAGGGCAGCGGCAAGGGCGTGAGCTTCCAGCTGGTGAAGCTGGGCGTG 22 63 

I Mill II II II II I I I I I I I I I I I I I Mill II II 

22 01 ATCAGGGATCAGGGAAAGGTGTCAGCTTCCAGCTCGTCAAGCTAGGTGTC 2250 

■ . • - - - 

22 64 TGGAAGAGCCCCACCGACATGCAGAGCTGGGTGCCCCTGAGCACCGACGA 2 313 

Mill I I I II M M I II I I I I I I I I Ml I M I I I I 

22 51 TGGAAATCCCCAACCGACATGCAATCCTGGGTCCCCTTATCAACGGATGA 2300 

• • ■ • 

2314 CCCCGTGATCGACCGCCTGTACCTGAGCAGCCACCGCGGCGTGATCGCCG 2363 

. I I I M I I I I I I I I ! I I I I II I I I I I I I I I I I I 

2301 TCCAGTGATAGACAGGCTTTACCTCTCATCTCACAGAGGTGTTATCGCTG 2350 

. • • • • 

2364 ACAACCAGGCCAAGTGGGCCGTGCCCACCACCCGCACCGACGACAAGCTG 2413 

I M II II II II I I I I I II II II II II II II I 1 I I I I II 
2351 ACAAcCAAGCAAAATGGGCTGTCCCGACAACACGAACAGATGACAAGTTG 24 00 

• - • • * * 
2414 CGCATGGAGACCTGCTTCCAGCAGGCCTGCAAGGGCAAGATCCAGGCCCT 24 63 

II II I I I M I MINIM | I I II II I I I I I 1 I I I I I I I I i I 

24 01 CGAATGGAGACATGCTTCCAACAGGCGTGTAAGGGTAAAATCCAAGCACT 2 4 50 

• - - - * 

24 64 GTGCGAGAACCCCGAaTGGGCCCCCCTGAAGGACAACCGCATCCCCAGCT 2513 

I I I I I I I I I I I I I I II 1 I II I I I I 11 1 III I II M I 
2451 CTGCGAGAATCCCGAGTGGGCACCATTGAAGGATAACAGGATTCCTTCAT 2500 
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2514 ACGGCGTGCTGAGCGTGGACCTGAGCCTGACCGTGGAGCTGAAGATCAAG 2563 

| M I II II N II Mill I I I I I I I MMI II Mil 
2501 ACGGGGTCTTGTCTGTTGATCTGAGTCTGACAGTTGAGCTTAAAATCAAA 2550 

2564 ATCGCGAGCGGCTTCGGCCCCCTGATCACCCACGGCAGCGGCATGGACCT 2 613 

| | | | I I I ! I I I I I I I I I I I I I I I I I I I M I I I I I I 

2551 ATTGCTTCGGGATTCGGGCCATTGATCACACACGGTTCAGGGATGGACCT 2 60 0 

2 614 GTACAAGAGCAACCACAACAACGTGTACTGGCTGACCATCCCCCCCATGA 2663 

| | | | I III I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2601 ATACAAATCCAACCACAACAATGTGTATTGGCTGACTATCCCGCCAATGA 2650 

„ . « • • • • 

2 6 64 AGAACCTGGCCCTGGGCGTGATCAACACCCTGGAGTGGATtCCCCGCTTC 2713 

|| | | | | | I I I I I I I I II I I I I II I I II I I II I I M I Ml 
2 651 AGAACCTAGCCTTAGGTGTAATCAACACATTGGAGTGGATACCGAGATTC 2700 

- . - 

2714 AAGGTGAGCCCCTACCTGTTCACCGTGCCCATCAAGGAGGCCGGCGAGGA 2763 

.1 I M I I I I I I I II II Mill II II II I II I I I I Mill II 
27 01 AAGGTTAGTCCCTACCTCTTCAcTGTCCCAATTAAGGAAGCAGGCGAAGA 27 50 

27 64 CTGCCACGCCCCGACCTACCTGCCCGCCGAGGTGGACGGCGACGTGAAGC 2813 

M | I I I I I I I I II MIM M M I 1 I I I I ! I I I II II II I 
2751 CTGCCATGCCCCAACATACCTACCTGCGGAGGTGGATGGTGATGTCAAAC 2800 

■ . . - 

2814 TGAGCAGCAACCTGGTGATCCTGCCCGGCCAGGACCTGCAGTACGTGCTG 28 63 

III ill I I I I I II I I I I I II I I I I I I I I I I I I M 

28 01 TCAGTTCCAATCTGGTGATTCTACCTGGTCAAGATCTCCAATATGTTTTG 2 8 50 

. ■ . - - * 

28 64 GCCACCTACGACACCAGCCGCGTGGAGCACGCCGTGGTGTACTACGTGTA 2913 

,|| || || || I I II I I M II II II I II I I II I I I I I M 
2 8 51 GCAACCTACGATACTTCCAGGGTTGAACATGCTGTGGTTTATTACGTTTA 2900 

• " - 

2914 CAGCCCCGGGCGCAGCTTCTTCTACTTCTACCCCTTCCGCCTGCCCATCA 2963 

MINI I II I I I II I I I I I I I II M II I I I I I I I I 
2901 CAGCCCAgGCCGCTCATTTTtTTACTTTTATCCTTTTAGGTTGCCTATAA 2 950 

2964 AGGGCGTGCCCATCGAGCTGCAGGTGGAGTGCTTCACCTGGGACCAGAAG 3013 

I I I I II I I I I I I II I II I II I I I I I I I I I I I I I I I I M II 

2951 AGGGGGTCCCCATCGAATTACAAGTGGAATGCTTCACATGGGACCAAAAA 300 0 

- 

3014 CTGTGGTGCCGCCACTTCTGCGTGCTGGCCGACAGCGAGAGCGGCGGCCA- 3063 

II II I I 11 I I I I I I I I I I MM! II III II II II II 
3001 CTCTGGTGCCGTCACTTCTGTGTGCTTGCGGACTCAGAATCTGGTGGACA 3050 

- • • - 

3064 CATCACCCACAGCGGCATGGTGGGCATGGGCGTGAGCTGCACCGTGACCC 3113 

I I I I I I I t II I I I I I I I I I I I I M M I 1 I M I M II Mil 
3051 TATCACTCACTCTGGGATGGtGGGCATGGGAGTCAGCTGCACAGTCACCC 3100 

/ 

3114 GCGAGGACGGCACCAACCGCCGCTAG 3139 

I II M M MIM III I III 
3101 GGGAAGATGGAACCAATCGCAGATAG 312 6 ^ 
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Fig. 42B : F MU H MV -seq check: 4381 from: 1 to: 3126 
nucleic acid sequence of F MU H MV (non humanised) 

ATGAAGGCTTTTCCAGTTATTTGCTTGGGCTTTGCAATCTTTTCATCCTC 
TATATGTGTGAATATCAATATCTTGCAGCAAATTGGATACATCAAGCAAC 
AGGTCAGGCAACTAAGCTATTACTCACAAAGTTCAAGCTCCTACGTAGTG 
GTCAAGCTTTTACCGAATATCCAACCCACTGATAACAGCTGTGAATTTAA 
GAGTGTAACTCAATACAATAAGACCTTGAGTAATTTGCTTCTTCCAATTG 
CAGAAAACATAAACAATATTACGTCGCCCTCACCTGGGTCAAGACGTCAT 
AAACGGTTTGCTGGCATTGCCATTGGCATTGCgGCcCTCGGTGTTGCGAC 

* ■ * 

CGCAGCACAAGTGACTGCCGCTGTCTCATTAGTTCAAGCACAGACAAATG 
CACGTGCAATAGCAGCGATGAAAAATTCAATACAGGCAACTAATCGGGCA 
GTCTTCGAAGTGAAGGAAGGCACCCAACAGTTAGCTATAGCGGTACAAGC 
cATcCAAGACGATATCAATACTATTATGAACACCCAATTGAACAATATGT 
CTTGTCAGATCCTTGATAACCAGCTTGCAACCTCCCTAGGATTATACCTA 
ACAGAATTAACAACAGTGTTTCAGCCACAATTAATTAATCCAGCATTGTC 
ACCGATTAGTATACAAGCCTTGAGGTCTTTGCTTGGAAGTATGACACCTG 
CAGTGGTTCAAGCAACATTATCTACTTCAATTTCTGCTGCTGAAATACTA 
" AGTGCCGGTCTAATGGAGGGTCAGATAGTTTCTGTTCTGCTAGAT.GAGAT 
GCAGATGATAGTTAAGATAAACGTTCCAACCATTGTCACACAATCAAATG 
CATTGGTGATTGACTTCTACTCAATTTCGAGTTTTATTAATAATCAAGAA 
TCCATAATTCAATTGCCAGACAGGATCTTGGAGATCGGAAATGAACAATG 
GCGCTATCCAGCTAAGAATTGTAAGTTGAC.AAGACACCACATATTCTGCC 
AATACAATGAGGCAGAGAGGCTGAGCCTAGAAACAAAACTATGCCTTGCA 
GGCAATATTAGTGCCTGTGTGTTCTCATCTATAGCAGGGAGTTATATGAG 
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GCGATTTGTAGCACTGGATGGAACAATTGTTGCAAACTGTCGAAGTCTAA 
CGTGTCTATGCAAGAGTCCATCTTATCCTATATACCAACCTGACCATCAT 
GCAGTCACGACCATTGATCTAACGTCATGTCAAACATTGTCCCTGGACGG 

f 

ACTGGATTTCAGCATTGTCTCGCTAAGCAACATCACTTACGCTGAGAATC 
TTACTATTTCATTGTCTCAGACAATCAATACTCAACCCATTGATATATCA 
ACTGAGCTGAGTAAGGTTAATGCATCCCTCCAAAATGCCGTTAAATACAT 
AAAAGAGAGTAACCATCAACTCCAATCCGT'TAGTGTAAGTTCTAAAAGAC 
TTCATCGGGCAGCCATCTACACCGCAGAGATCCATAAAAGCCTCAGCACC 
AATCTAGATGTAACTAACTCAATCGAGCATCAGGTCAAGGACGTGCTGAC 
ACCACTCTTCAAAATCATCGGTGATGAAGTGGGCCTGAGGACACCTCAGA 
GATTCACTGACCTAGTGAAATTCATCTCTGACAAGATTAAATTCCTTAAT 
CCGGATAGGGAGTACGACTTCAGAGATCTCACTTGGTGTATCAACCCGCC 
AGAGAGAATCAAATTGGATTATGATCAATACTGTGCAGATGTGGCTGCTG 
AAGAGCTCATGAATGCATTGGTGAACTCAACTCTACTGGAGACCAGAACA 
ACCAATCAGTTCCTAGCTGTCTCAAAGGGAAACTGCTCAGGGCCCACTAC 
AATCAGAGGTCAATTCTCAAACATGTCGCTGTCCCTGTTAGACTTGTATT 
TAGGTCGAGGTTACAATGTGTCATGTATAGTCACTATGACATCCCAGGGA 
ATGTATGGGGGAACTTACCTAGTGGAAAAGCCTAATCTGAGCAGCAAAAG 
GTCAGAGTTGTCACAACTGAGCATGTACCGAGTGTTTGAAGTAGGTGTTA 
TCAGAAATCCGGGTTTGGGGGCTCGGGTGTTCCATATGACAAACTATCTT 
GAGCAACCAGTCAGTAATGATCTCAGCAACTGTATGGTGGCTTTGGGGGA 
GCTCAAACTCGCAGCCCTTTGT.CACGGGGAAGATTCTATCACAATTCCCT 
ATCAGGGATCAGGGAAAGGTGTCAGCTTCCAGCTCGTCAAGCTAGGTGTC 
TGGAAATCCCCAACCGACATGCAATCCTGGGTCCCCTTATCAACGGATGA 



DEC 04 2000 18:59 



PAGE. 121 



•J 

WO 00/18929 PCT/EP99/07004 

/ 

\ 

67/73 

TCCAGTGATAGACAGGCTTTACCTCTCATCTCACAGAGGTGTTATCGCTG 
ACAAcCAAGCAAAATGGGCTGTCCCGACAACACGAACAGATGACAAGTTG 
CGAATGGAGACATGCTTCCAACAGGCGTGTAAGGGTAAAATCCAAGCACT 
CTGCGAGAATCCCGAGTGGGCACCATTGAAGGATAACAGGATTCCTTCAT 
ACGGGGTCTTGTCTGTTGATCTGAGTCTGACAGTTGAGCTTAAAATCAAA 
ATTGCTTCGGGATTCGGGCCATTGATCACACACGGTTCAGGGATGGACCT 
AT AC AAAT C C AAC C ACAACAAT GTGTATT GGCTG ACT AT CCC GCCAATGA 
AGAACCTAGCCTTAGGTGTAATCAACACATTGGAGTGGATACCGAGATTC 
AAGGTTAGTCCCTACCTCTTCAcTGTCCCAATTAAGGAAGCAGGCGAAGA 
CTGCCATGCCCCAACATACCTACCTGCGGAGGTGGATGGTGATGTCAAAC 
TCAGTTCCAATCTGGTGATTCTACCTGGTCAAGATCTCCAATATGTTTTG 
GCAACCTACGATACTTCCAGGGTTGAACATGCTGTGGTTTATTACGTTTA 
CAGCCCAgGCCGCTCATTTTtTTACTTTTATCCTTTTAGGTTGCCTATAA 
AGGGGGTCCCCATCGAATTACAAGTGGAATGCTTCACATGGGACCAAAAA 
CTCTGGTGCCGTCACTTCTGTGTGCTTGCGGACTCAGAATCTGGTGGACA 
TATCACTCACTCTGGGATGGtGGGCATGGGAGTCAGCTGCACAGTCACCC 
GGGAAGATGGAACCAATCGCAGATAG 
[SEQIDNO:97] 
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Fig 42C: F MUV humH M hum.seq check: 5778 from: 14 to: 3139 
Humanised nucleic acids sequence of F MuV H Mv 

ATGAAGGCGTTCCCCGTGATCTGCCTGGGCTTCGCCATCTTCTCCAGCAG 
CATCTGCGTGAACATCAACATCCTGCAGCAGATCGGATACATCAAGCAGC 
AGGTGAGGCAGCTGAGCTACTACTCCCAGAGCTCCAGCTCCTACGTGGTG 

i 

GTCAAGCTGCTGCCCAACATCCAGCCCACCGACAACAGCTGCGAGTTCAA 
GAGCGTGACCCAGTACAACAAGACCCTGAGCAAGCTGCTGCTGCCCATCG 
CCGAGAACATCAACAACATCACCTCCCCCTCCCCCGGCTCCCGGCGGCAC 
AAGCGGTTCGCCGGCATCGCCATCGGCATCGCCGCCCTGGGCGTGGCCAC 
' CGCCGCCCAGGTGACCGCCGCCGTGTCCCTGGTGCAGGCCCAGACCAACG 
CCCGCGCCATCGCCGCCATGAAGAACTCCATCCAGGCCACCAACCGCGCC 
GTGTTCGAGGTGAAGGAGGGCACCCAGCAGCTGGCCATCGCCGTGCAGGC 
CATCCAGGACCACATCAACACCATCATGAACAGCCAGCTGAACAACATGT 
CCTGCCAGATCCTGGACAACCAGGTGGCCACCTCCCTGGGCCTGTACCTG 

ACCGAGCTGACCACCGTGTTCCAGCCCCAGCTGATCAACCCCGCCCTGtc 

if' 

cCCCATCAGTATCCAGGCCCTGCGGTCCCTGCTGGGCAGCATGACCCCCG 
CCGTGGTGCAGGCCACCCTGAGCACCTCCATCAGCGCCGCCGAGATCCTG 
AGCGCCGGCCTGATGGAGGGCCAGATCGTGTCCGTGCTGCTGGACGAGAT 
GCAGATGATCGTGAAGATCAACGTGCCCACCATCGTGACCCAGTCCAACG 
CCCTGGTGATCGACTTCTACAGCATCAGCAGCTTCATCAACAACCAGGAG 
' ' TCCATCATCCAGCTGCCCGACCGCATCCTGGAGATCGGCAACGAGCAGTG 
GCGCTACCCCGCCAAGAACTGCAAGCTGACCCGCCACCACATCTTCTGCC 
AGTACAACGAGGCCGAGCGGCTGAGCCTGGAGACCAAGCTGTGCCTGGCC 
j GGCAACATCAGCGCCTGCGTGTTCTCCAGCATCGCCGGCAGCTACATGCG 
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CCGCTTCGTGGCCCTGGACGGCACCATCGTGGCCAACTGCCGCAGCCTGA 
CCTGCCTGTGCAAGAGCCCCTCCTACCCCATCTACCAGCCCGACCACCAC 
GCCGTGACCACCATCGACCTGACCTCCTGCCAGACCCTGAGCCTGGACGG 
CCTGGACTTCAGCATCGTGTCCCTGAGCAACATCACCTACGCCGAGAACC 
TGACCATCAGCCTGAGCCAGACCATCAACACCCAGCCCATCGACATCTCC 
ACCGAGCTGAGCAAGGTGAACGCCTCCCTGCAGAACGCCGTGAAGTACAT 
CAAGGAGAGCAACCACCAGCTGCAGAGCGTGAGCGTGAGCAGCAAGCGCC 
TGCACCGCGCCGCCATCTACACCGCCGAGATCCACAAGAGCCTGAGCACC 
AACCTGGACGTGACCAACTCCATCGAGCACCAGGTGAAGGACGTGCTGAC 
CCCCCTGTTCAAGATCATCGGCGACGAGGTGGGCCTGCGCACCCCCCAGC 
GCTTCACCGACCTGGTGAAGTTCATCTCCGACAAGATCAAGTTCCTGAAC 

cccgaccgcgagtacgact.tccgcgacctgacctggtgcatcaacccccc 
cgagcggatcaagctggactacgaccagtactgcgccgacgtggccgccg 
aggagctgatgaacgccctggtgaacagcaccctgctggagacccgcacc 
accaaccagttcctggccgtgagcaagggcaactgcagcggccccaccac 
catccggggccagttcagcaacatgagcctgtccctgctggacctgtacc 
tgggccg'gggctacaacgtgagcagcatcgtgaccatgaccagccagggc 
atgtacggcggcacctacctggtggagaagcccaacctgagcagcaagcg 
gagcgagctgagccagctgagcatgtaccgcgtgttcgaggt^ 

TCCGGAACCCCGGCCTGGGCGCCCCCGTGTTCCAGATGACCAACTACCTG 
GAGCAGCCCGTGAGCAACGACCTGAGCAACTGCATGGTGGCCCTGGGCGA 
GCTGAAGCTGGCCGCCCTGTGCCACGGCGAGGACAGCATCACCATCCCCT 
ACCAGGGCAGCGGCAAGGGCGTGAGCTTCCAGCTGGTGAAGCTGGGCGTG 
TGGAAGAGCCCCACCGACATGCAGAGCTGGGTGCCCCTGAGCACCGACGA 
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f 

CCCCGTGATCGACCGCCTGTACCTGAGCAGCCACCGCGGCGTGATCGCCG 
ACAACCAGGCCAAGTGGGCCGTGCCCACCACCCGCACCGACGACAAGCTG 
CGCATGGAGACCTGCTTCCAGCAGGCCTGCAAGGGCAAGATCCAGGCCCT 
GTGCGAGAACCCCGAaTGGGCCCCCCTGAAGGACAACCGCATCCCCAGCT 
ACGGCGTGCTGAGCGTGGACCTGAGCCTGACCGTGGAGCTGAAGATCAAG 

V 

ATCGCGAGCGGCTTCGGCCCCCTGATCACCCACGGCAGCGGCATGGACCT 
GTACAAGAGCAACCACAACAACGTGTACTGGCTGACCATCCCCCCCATGA 
AGAACCTGGCCCTGGGCGTGATCAACACCCTGGAGTGGATtCCCCGCTTC 
AAGGTGAGCCCCTACCTGTTCACCGTGCCCATCAAGGAGGCCGGCGAGGA 
CTGCCACGCCCCGACCTACCTGCCCGCCGAGGTGGACGGCGACGTGAAGC 
TGAGCAGCAACCTGGTGATCCTGCCCGGCCAGGACCTGCAGTACGTGCTG 

GCCACCTACGACACCAGCCGCGTGGAGCACGCCGTGGTGTACTACGTGTA 

f 

CAGCCCCGGCCGCAGCTTCTTGTACTTCTACCCCTTCCGCCTGCCCATCA. 
AGGGCGTGCCCATCGAGCTGCAGGTGGAGTGCTTCACCTGGGACCAGAAG 
CTGTGGTGCCGCCACTTCTGCGTGCTGGCCGACAGCGAGAGCGGCGGCCA 
CATCACCCACAGCGGCATGGTGGGCATGGGCGTGAGCTGCACCGTGACCC 

GCGAGGACGGCACCAACCGCCGCTAG 
[SEQ ID NO: 98] 
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